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Preface 


Probability theory had its beginnings in the early seventeenth century as a result of 
investigations of various games of chance. Since then many leading mathematicians and 
scientists made contributions to the theory of probability. However, despite its long and 
active history, probability theory was not axiomatized until the twenties and thirties of 
this century. This axiomatic development, called modern probability theory, was now able 
to make the concepts of probability precise and place them on a firm mathematical 
foundation. 

The importance of probability has increased enormously in recent years. Today the 
notions of probability and its twin subject statistics appear in almost every discipline, e.g. 
physics, chemistry, biology, medicine, psychology, sociology, political science, education, 
economics, business, operations research, and all fields of engineering. 

This book is designed for an introductory course in probability with high school algebra 
as the only prerequisite. It can serve as a text for such a course, or as a supplement to all 
current comparable texts. The book should also prove useful as a supplement to texts and 
courses in statistics. Furthermore, as the book is complete and self-contained it can easily 
be used for self-study. 

The book begins with a chapter on sets and their operations, and follows with a chapter 
on permutations, combinations and other techniques of counting. Next is a chapter on 
probability spaces and then a chapter on conditional probability and independence. The 
fifth and main chapter is on random variables. Here we define expectation, variance and 
standard deviation, and prove TchebychefPs inequality and the law of large numbers. 
Although calculus is not a prerequisite, both discrete and continuous random variables are 
considered. We follow with a separate chapter on the binomial, normal and Poisson 
distributions. Here the central limit theorem is given in the context of the normal approxi¬ 
mation to the binomial distribution. The seventh and last chapter offers a thorough ele¬ 
mentary treatment of Markov chains with applications. 

Each chapter begins with clear statements of pertinent definitions, principles and 
theorems together with illustrative and other descriptive material. This is followed by 
graded sets of solved and supplementary problems. The solved problems serve to illustrate 
and amplify the theory, bring into sharp focus those fine points without which the student 
continually feels himself on unsafe ground, and provide the repetition of basic principles so 
vital to effective learning. Proofs of most of the theorems are included among the solved 
problems. The supplementary problems serve as a complete review of the material of each 
chapter. 

I wish to thank Dr. Martin Silverstein for his invaluable suggestions and critical review 
of the manuscript. I also wish to express my appreciation to Daniel Schaum and Nicola 
Monti for their excellent cooperation. 

Seymour Lxpschutz 

Temple University 
November, 1968 
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Chapter 1 


Set Theory 


INTRODUCTION 


This chapter treats some of the elementary ideas and concepts of 
necessary for a modern introduction to probability theory. 


set theory which are 


SETS, ELEMENTS 

An ,y well defined list or collection of objects is called a set ; the objects comprising the 
set are called its elements or members. We write 

PGA if p is an element in the set A 

s slid to 1S h° *2 3 7 B ’ ^ if p G A implies V&B ’ then A is called a 

suoset ot B or is said to be contained m B; this is denoted by 

A C B or B D A 

Two sets are equal if each is contained in the other; that is, 

A—B if and only if A c B and B c A 

The negations of p e A, A cB and A = B are written pdA, A <tB and A¥=B respectively. 

We specify a particular set by either listing its elements or by stating properties which 
characterize the elements of the set. For example, Properties which 

A = {1,3, 5, 7, 9} 

means A is the set consisting of the numbers 1, 3, 5, 7 and 9; and 

B = {x : x is a prime number, x < 15} 
means that B is the set of prime numbers less than 15. 

fixed^etclltdTh^r Stated ; al ! Set ® under investigation are assumed to be subsets of some 

the emptTor77 7°^ ^ ^ Chapter) by U ' We also use 0 to denote 

of every other ft Tht frfr 8 whl f ch /1 Conta “ s no dements; this set is regarded as a subset 
very otner set. lhus for any set A, we have 0cAc£7. 

Example 1.1. The sets A and B above can also be written as 

A = {x : x is an odd number, x < 10} and B - {2, 3, 5, 7, U f 13 } 
??r V and h 6 at S5 eA ^ 11 e * but H«A; whereas 3GA and 3GB, and 
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Example 1.2: We use the following special symbols: 

N = the set of positive integers: 1, 2, 3, ... 

Z = the set of integers: ..—2, —1, 0, 1, 2, ... 

R = the set of real numbers. 

Thus we have NcZcR, 

Example 1.3: Intervals on the real line, defined below, appear very often in mathematics. Here 
a and b are real numbers with a < b. 

Open interval from a to b = (a, b) = {% : a < x < b} 

Closed interval from a to & = [a t b] = {x : a — x — b} 

Open-closed interval from a to 6 = (a,b] = {x : a < x — 6} 

Closed-open interval from a to b — [a, b) = {x : a — x < b} 

The open-closed and closed-open intervals are also called half-open intervals. 

Example 1.4: In human population studies, the universal set consists of all the people in the 

world. 

Example 1.5: Let C = {x : a; 2 = 4, x is odd}. Then C = 0; that is, C is the empty set. 

The following theorem applies. 

Theorem 1.1: Let A, B and C be any sets. Then: (i) Ac A; (ii) if AcB and BcA then 
A = B; and (iii) if AcB and BcC then AcC. 

We emphasize that AcB does not exclude the possibility that A — B . However, if 
A c B but A¥*B, then we say that A is a proper subset of B. (Some authors use the symbol 
C for a subset and the symbol C only for a proper subset.) 


SET OPERATIONS 

Let A and B be arbitrary sets. The union of A and B, denoted by AuB, is the set of 
elements which belong to A or to B: 

A L) B = {x : x G A or x G B} 

Here “or” is used in the sense of and/or. 


The intersection of A and B, denoted by AC)B, is the set of elements which belong to 


both A and B : 


A n B = {x : x G A and x G B) 


If AnZ? = 0, that is, if A and B do not have any elements in common, then A and B 
are said to be disjoint. 


The difference of A and B or the relative complement of B with respect to A, denoted by 
A \ B, is the set of elements which belong to A but not to B : 

A\B — {x : x G A, x & B} 

Observe that A\B and B are disjoint, i.e. (A\B) n B = 0. 

The absolute complement or, simply, complement of A, denoted by A c , is the set of 
elements which do not belong to A: 

A c = {x : x G U, x & A) 

That is, A c is the difference of the universal set U and A. 
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Example 1»6: The following diagrams, called Venn diagrams, illustrate the above set operations. 

Here sets are represented by simple plane areas and U, the universal set, by 
the area in the entire rectangle. 



A\B is shaded. A c is shaded. 


Example 1.7: Let A = { 1,2,3,4} and £ = {3,4,5,6} where U = { 1,2,3,...}. Then 

AuB = {1,2,3,4,B,6} AnB = {3,4} 

A\B = {1,2} A c = {5,6,7, ...} 

Sets under the above operations satisfy various laws or identities which are listed 
in the table below (Table 1). In fact, we state 

Theorem 1.2: Sets satisfy the laws in Table 1. 



Table 1 
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Remark: Each of the above laws follows from an analogous logical law. For example, 

A C\ B = {x : x G A and x G B) = {x : x G B and x G A} = B f\ A 

Here we use the fact that the composite statement “p and q”, written p a q, is logically 
equivalent to the composite statement “q and p”, i.e. q a p. 

The relationship between set inclusion and the above set operations follows: 

Theorem 1.3: Each of the following conditions is equivalent to AaB: 

(i) AnB = A (iii) B C CA C (v) BuA c = U 

(ii) AuB = B (iv) AnB c = 0 


FINITE AND COUNTABLE SETS 

Sets can be finite or infinite. A set is finite if it is empty or if it consists of exactly n 
elements where n is a positive integer; otherwise it is infinite. 


Example 1.8: Let M be the set of the days of the week; that is, 

M — {Monday, Tuesday, Wednesday, Thursday, Friday, Saturday, Sunday} 
Then M is finite. 


Example 1.9: Let P = {x : x is a river on the earth}. Although it may be difficult to count the 

number of rivers on the earth, P is a finite set. 

Example 1.10: Let Y be the set of (positive) even integers, i.e. Y ~ {2,4,6, . ..}. Then Y is an 
infinite set. 


Example 1.11: Let I be the unit interval of real numbers, i.e. I — {x : 0 - x - 1}. Then I is 
also an infinite set. 


A set is countable if it is finite or if its elements can be arranged in the form of a 
sequence, in which case it is said to be countably infinite; otherwise the set is uncountable. 
The set in Example 1.10 is countably infinite, whereas it can be shown that the set in 
Example 1.11 is uncountable. 


PRODUCT SETS 

Let A and B be two sets. The product set of A and B, denoted by Ax B, consists of all 
ordered pairs (a, b) where a G A and b G B: 

Ax B = {(a, b) : a G A, b G Bj 

The product of a set with itself, say Ax A, is denoted by A 2 . 

Example 1.12: The reader is familiar with the cartesian plane R 2 = R X R as shown below. 

Here each point P represents an ordered pair (a, b) of real numbers, and vice versa. 



-2 

p 

III 0 

■1 


-3 -2 -1 

1 2 a 3 

-1 

-2 


Example 1.13: Let A = {1,2,3} and B = {a,b}. Then 

A X B = {(1, a), (1, 6), (2, a), (2, 6), (3, a), (3, b)} 
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The concept of product set is extended to any finite number of sets in a natural way. 
The product set of the sets A lf A 2 , ...,A m , written A x x A 2 x • * • x A m , is the set of all 
ordered ^-tuples (a u a 2y .. ., a m ) where a* G Ai for each i . 


CLASSES OF SETS 

Frequently the members of a set are sets themselves. For example, each line in a set 
of lines is a set of points. To help clarify these situations, we usually use the word class 
or family for such a set. The words subclass and subfamily have meanings analogous to 
subset. 


Example 1.14: The members of the class {{2,3}, {2}, {5,6}} are the sets {2,3}, {2} and {5,6}. 

Example 1.15: Consider any set A. The power set of A, denoted by c P{A) i is the class of all sub¬ 
sets of A. In particular, if A = {a , b, c}, then 

*(A) = {A,{a,b} f {a,c},{b,c},{a},{b},{c},0} 

In general, if A is finite and has n elements, then *P(A) will have 2 W elements. 

A partition of a set X is a subdivision of X into nonempty subsets which are disjoint and 
whose union is X y i.e. is a class of nonempty subsets of X such that each a G X belongs to 
a unique subset. The subsets in a partition are called cells . 

Example 1.16: Consider the following classes of subsets of 1 = {1,2, .. .,8,9}: 

(i) [{1,3,5}, {2,6}, {4,8,9}] 

(ii) [{1,3,5}, {2, 4, 6, 8}, {5,7,9}] 

(iii) [{1,3,5}, {2, 4, 6, 8}, {7,9}] 

Then (i) is not a partition of X since 7 E X but 7 does not belong to any of the cells. 
Furthermore, (ii) is not a partition of X since 5 GX and 5 belongs to both {1,3,5} 
and {5,7,9}. On the other hand, (iii) is a partition of X since each element of X 
belongs to exactly one cell. 

When we speak of an indexed class of sets [A { : i G1} or simply {Ai}, we mean that there 
is a set Ai assigned to each element i G I. The set I is called the indexing set and the sets 
Ai are said to be indexed by /. When the indexing set is the set N of positive integers, the 
indexed class {A lf A 2 , . . .} is called a sequence of sets. By the union of these A if denoted 
by U iGI Ai (or simply Ui Ai), we mean the set of elements each belonging to at least one of 
the Ai, and by the intersection of the A iy denoted by ru € j Ai (or simply Hi Ai), we mean the 
set of elements each belonging to every At. We also write 

UjLi Ai = AiUA 2 U-** and Pi T=i Ai — AinA 2 n*** 
for the union and intersection, respectively, of a sequence of sets. 

Definition: A nonempty class cA of subsets of U is called an algebra ( a-algebra ) of sets if: 

(i) the complement of any set in cA belongs to c A ; and 

(ii) the union of any finite (countable) number of sets in c A belongs to c A ; 
that is, if oA is closed under complements and finite (countable) unions. 

It is simple to show (Problem 1.30) that an algebra (cr-algebra) of sets contains U and 
0 and is also closed under finite (countable) intersections. 
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Solved Problems 

SETS, ELEMENTS, SUBSETS 

1.1. Let A = {x:3x = 6}. Does A = 2? 

A is the set which consists of the single element 2, that is, A = {2}. The number 2 belongs 
to A; it does not equal A. There is a basic difference between an element p and the singleton 
set {p}. 

1.2. Which of these sets are equal: {r, s, t}, {t, s, r}, {s, r, t }, {t, r, s} ? 

They are all equal. Order does not change a set. 

1.3. Determine whether or not each set is the null set: 

(i) X = {x : x 2 = 9, 2x- 4}, (ii) Y = {x : x ¥> x}, (iii) Z = {x : x + 8 = 8}. 

(i) There is no number which satisfies both x 2 = 9 and 2x = 4; hence X is empty, i.e. X = 0. 

(ii) We interpret “ = ” to mean “is identical with” and so Y is also empty. In fact, some texts define 
the empty set as follows: 0 = {x : x # #}. 

(iii) The number zero satisfies x 4- 8 = 8; hence Z — {0}. Accordingly, Z is not the empty set 
since it contains 0. That is, Z ¥* 0. 

1.4. Prove that A = {2,3,4,5} is not a subset of B = {x : x is even}. 

It is necessary to show that at least one element in A does not belong to B . Now 3GA and, 
since B consists of even numbers, 3 &B\ hence A is not a subset of B . 

1.5. Let V = { d }, W = {c, d}, X = {a, b, c}, Y = {a, b} and Z = {a, 6, d}. Determine 
whether each statement is true or false: 

(i) YcX, (ii) W * Z, (iii) Z D V, (iv) VcX, (v) X = W, (vi) W c Y. 

(i) Since each element in Y is a member of X> Y C X is true. 

(ii) Now a^Z but a & W; hence W t 6 Z is true. 

(iii) The only element in V is d and it also belongs to Z; hence Z D V is true. 

(iv) V is not a subset of X since d G V but d £ X; hence V cX is false. 

(v) Now aGJ but a € W; hence X = W is false. 

(vi) W is not a subset of Y since c £ W but c & Y ; hence W c Y is false. 

1.6. Prove: If A is a subset of the empty set 0, then A = 0. 

The null set 0 is a subset of every set; in particular, 0 c A. But, by hypothesis, A C 0; 
hence A — 0. 

1.7. Prove Theorem 1.1 (iii): If A c B and B C C, then AcC. 

We must show that each element in A also belongs to C. Let x £ A, Now A CB implies 
x £ B. But B C C; hence x £ C. We have shown that a; G A implies x £ C, that is, that AcC. 

1.8. Which of the following sets are finite? 

(i) The months of the year. (iv) The set Q of rational numbers. 

(ii) {1,2,3, ..., 99,100}. (v) The set R of real numbers. 

(iii) The number of people living on the earth. 

The first three sets are finite; the last two are infinite. (It can be shown that Q is countable 
but R is uncountable.) 
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1.9. Consider the following sets of figures in the Euclidean plane: 

A = {x : x is a quadrilateral} C = {x : x is a rhombus} 

B = {x : x is a rectangle} D = {x : x is a square} 

Determine which sets are proper subsets of any of the others. 

Since a square has 4 right angles it is a rectangle, since it has 4 equal sides it is a rhombus, 
and since it has 4 sides it is a quadrilateral. Thus 

Be A, BeB and D c C 

that is, D is a subset of the other three. Also, since there are examples of rectangles, rhombuses and 
quadrilaterals which are not squares, D is a proper subset of the other three. 

In a similar manner we see that B is a proper subset of A and C is a proper subset of A. There 
are no other relations among the sets. 


1.10. Determine which of the following sets are equal: 0, f0), {0}. 

Each is different from the other. The set {0} contains one element, the number zero. The 
set 0 contains no elements; it is the empty set. The set {0} also contains one element, the null set. 


SET OPERATIONS 

1.11. Let Z7 = {1,2, ..., 8, 9}, A = {1,2,3,4}, B={2,4,6,8} and C = (3,4,5,6}. Find: 
(i) A c , (ii) AnC, (iii) (AnC) c , (iv) AuB, (v) B\C. 

(i) A c consists of the elements in U that are not in A; hence A c — {5,6, 7,8,9}. 

(ii) AnC consists of the elements in both A and C; hence AnC — {3,4}. 

(iii) (AnC) c consists of the elements in U that are not in AnC. Now by (ii), AnC = {3,4} and 
so (A n C) c = {1,2,5,6,7,8,9}. 

(iv) A UB consists of the elements in A or B (or both): hence AuB = {1,2, 3, 4, 6, 8}. 

(v) B\C consists of the elements in B which are not in C; hence B\C = { 2, 8). 

1.12. Let U = (a, 6, c, d, e}, A — {a,b,d} and B — {b, d, e}. Find: 

(i) A US (iii) B c (v) A C DB (vii) A c nB c (ix) (AnB) c 

(ii) Bn A (iv) B\A (vi) AuB c (viii) B C \A C (x) (AUB) C 

(i) The union of A and B consists of the elements in A or in B (or both); hence A Uf? = {a, 6, d, e}. 

(ii) The intersection of A and B consists of those elements which belong to both A and B; hence 
AnB = {6,d}. 

(iii) The complement of B consists of the letters in U but not in B\ hence B c = {a, c}. 

(iv) The difference B\A consists of the elements of B which do not belong to A; hence 
B\A={e}. 

(v) A c = {c,e} and B = {b,d, e}; then A c nB = {e}. 

(vi) A = {a, b y d) and B c = {a, c}; then AuB c = {a, 6, c, d}. 

(vii) and (viii). A c — {c, e } and Be = {a, c}; then 

A c nBc = {c> and B°\A C = {a} 

(ix) From (ii), AnB ~ {b,d}; hence (A nB) c = {a, c, e}. 

(x) From (i), A Uf? = {a, 6, d, e)\ hence (AuB) c “{c). 
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1.13. In the Venn diagram below, shade: (i) B c , (ii) (AuB) c , (iii) ( B\A) C , (iv) A C C\B C . 



(i) Be consists of the elements which do not belong to B; hence shade the area outside B as follows: 



(ii) First shade AvB; then (A U By is the area outside AuB: 




(iii) First shade B\A, the area in B which does not lie in A; then (B\A) C is the area outside 
B\A: 




(iv) First shade A c , the area outside of A, with strokes slanting upward to the right (////), and 
then shade B c with strokes slanting downward to the right (\^\); then A c nB c is the 
cross-hatched area: 



A c and B c are shaded. 



Observe that ( AuB) c — A c nB c , as expected by DeMorgan’s law. 
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1.14. Prove: B\A — BnA c . Thus the set operation of difference can be written in 
terms of the operations of intersection and complementation. 

B\A = {x : x G B, x & A} = {x : x GB, x G A c } = B n A c 

1.15. Prove: For any sets A and B, AnB d A c AUB. 

A ? eAnB > then x d A and x G B. In particular, x G A. Since xGAnB implies xGA 

ADB C A. Furthermore if xG A, then x G A or x G B f i.e. x G AuB. Hence A C A U B. In other 
words, AnB c A c AuB. 


1.16. Prove Theorem 1.3(i): A C B if and only if AnB ~ A. 

Suppose AcB. Let x G A; then by hypothesis, x G B. Hence x G A and x G B y i.e. x G AnB. 

ccordmgly, A c AnB. On the other hand, it is always true (Problem 1.15) that AnB c A 
Thus AnB = A. 


Now suppose that AnB — A. Then in particular, A C AnB. 
AnB c B. Thus A c AnB c B and so, by Theorem 1.1, AcB. 


But it is always true that 


PRODUCT SETS 

1.17. Let M = {Tom, Marc, Erik} and W = {Audrey, Betty}. Find MxW. 

M XW consists of all ordered pairs (a, b) where and b € W. Hence 

M X W = {(Tom, Audrey), (Tom, Betty), (Marc, Audrey), 

(Marc, Betty), (Erik, Audrey), (Erik, Betty)} 

1.18. Let A = {1,2,3}, £ = {2,4} and C= {3,4,5}. Find A X B X C. 

A convenient method of finding A X B X C is through the so-called “tree diagram” shown below: 



(1, 2, 3) 
(1, 2, 4) 
(1, 2, 5) 

(1, 4, 3) 
(1. 4, 4) 
(1, 4, 5) 

(2, 2, 3) 
(2, 2, 4) 
(2, 2, 5) 

(2, 4, 3) 
(2, 4, 4) 
(2, 4, 5) 

(3, 2, 3) 
(3, 2, 4) 
(3, 2, 5) 

(3, 4, 3) 
(3, 4, 4) 
(3, 4, 5) 


The “tree” is constructed from the left to 
listed to the right of the “tree”. 


the right. A X B X C consists of the ordered triples 


1.19. Let A = {a, b}, B = {2,3} and C= {3,4}. Find: 

(i) A X (B U C), (ii) (A X B) U (A X C), (iii) A x (BnC ), (iv) (A x B) n (A X C). 

(i) First compute BuC = {2,3,4}. Then 

AX{BUC) = {{a, 2), {a, 3), (a, 4), (6,2), (6, 3), (6, 4)} 
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(ii) First find A X B and A X C: 

A X B = {(a, 2), (a, 3), (6,2), (6,3)} 

AXC = {(a, 3), (a, 4), (b, 3), (6, 4)} 

Then compute the union of the two sets: 

(AXB)U(AXC) = {(a, 2), (a, 3), (6,2), (6, 3), (a, 4), (6,4)} 

Observe from (i) and (ii) that 

A X (BuC) = (A X B) u (A X C) 

(iii) First compute BnC = {3}. Then 

Ax(BnC) = {(a, 3), (b, 3)> 

(iv) Now A X B and A X C were computed above. The intersection of A X B and A X C consists of 
those ordered pairs which belong to both sets: 

(A X B) n (A X C) = {(a, 3), (6, 3)} 

Observe from (iii) and (iv) that 

A x (BnC) = (A x B) n (A x C) 

1.20. Prove: Ax (BnC) = (Ax B) n (Ax C). 

AX (BnC) = {(x,y) : x G A, y G BnC} 

= {(X t y) : x G A, y G B, y G C} 

= {(x, t/) : (x, 2 /)GAx B, (x, i/JGAXC} 

- (A x B) n (A X C) 

1.21. Let S={a,&}, W = {1,2,3,4,5,6} and V = {3,5,7,9}. Find (S X W) n (S x F). 

The product set (S X XV) n (S X V) can be found by first computing S X W and S XV, and 
then computing the intersection of these sets. On the other hand, by the preceding problem, 
(SXW) n (SXV) = SX(WnV). Now WnV = {3,5}, and so 

(SXlf)n(3xy) = SX(WnV) = {(a, 3), (a, 5), (b, 3), (6, 5)} 

1.22. Prove: Let AcB and CcD ; then (A x C) C (B x D). 

Let (Xy y) be any arbitrary element in A X C; then x G A and y G C. By hypothesis, AcB and 

CcD; hence xGB and y G D. Accordingly (x, y) belongs to B X D. We have shown that 

(x,y) G A X C implies (x, y) G B X D\ hence (A X C) C (B X D). 

CLASSES OF SETS 

1.23. Consider the class A - {{2,3}, {4,5), {6}}. Which statements are incorrect and 
why? (i) {4,5) cA, (ii) {4,5} G A, (iii) {{4,5}}CA. 

The members of A are the sets {2, 3}, {4,5} and {6}. Therefore (ii) is correct but (i) is an 
incorrect statement. Moreover, (iii) is also a correct statement since the set consisting of the single 
element {4, 5} is a subclass of A. 

1.24. Find the power set C P(S ) of the set S = {1,2,3}. 

The power set <P(5) of <S is the class of all subsets of S; these are {1,2,3}, {1,2}, {1,3}, {2,3}, 
{1}, {2}, {3} and the empty set 0. Hence 

<P(S) = {S, {1,3}, {2,3}, {1,2}, {1}, {2}, {3}, 0} 


Note that there are 2 3 = 8 subsets of S. 




CHAP. 1] 


SET THEORY 


11 


1.25. Let X — {a,b,c,d,e,f,g}, and let: 

(i) Ai - {a, c, e), A 2 = {b}, As - {d, g}\ 

(ii) Bi = {a, e,g), B 2 - {c,d }, Bs = {b,e,f}\ 

(iii) Ci = { a,b,e,g }, C 2 = {c}, C 3 = {d,/}; 

(iv) Di = {a,b,c,d, e, f,g). 

Which of {Ai,A 2 ,As}, {Bi,B 2 ,B s }, {Ci,C 2 ,Cs}, {Di} are partitions of X? 

(i) {Aj, A 2 ,A 3 } is not a partition of X since / £! but f does not belong to either A 1( A 2 , or A 3 . 

(ii) B 3 ) is not a partition of X since e G X belongs to both B t and B 3 , 

(iii) {Ci, C 2 , C 3 } is a partition of X since each element in X belongs to exactly one cell, i.e. 
X = C 1 uC 2 uC 3 and the sets are pairwise disjoint. 

(iv) {.DJ is a partition of X. 

1.26. Find all the partitions of X = {a, b, c, d). 

Note first that each partition of X contains either 1, 2, 3, or 4 distinct sets. The partitions 
are as follows: 

(1) [{a, 6, c, d}] 

(2) [{a}, {b,c,d)) t [{b}, {a,c,d}], {{c}, {a,b,d}], [{d}, {a,b,c}], 

[{a,6}, {c,d}], [{a,c}, {&,«*}], [{a,d}, {b,c}] 

(3) [{a}, {6}, {c,d}], [{a}, {c}, {M>], [{a}, {d}, {6,c}], 

[{&},{«}, {a, d}], [{b}, {d}, {a,c)] t [{c}, {d}, {a, 6}] 

(4) [{a}, {6}, {c}, {d}] 

There are fifteen different partitions of X. 

1.27. Let N be the set of positive integers and, for each kGN, let 

A n = {x : x is a multiple of n} = {n, 2n, 3n, ...} 

Find (i) A 3 nAs, (ii) A i f\Ae„ (iii) UiepAj, where P is the set of prime numbers, 
2, 3, 5, 7, 11, ... . 

(i) Those numbers which are multiples of both 3 and 5 are the multiples of 15; hence A 3 nA 5 = A 15 . 

(ii) The multiples of 12 and no other numbers belong to both A 4 and A 6 ; hence A 4 nA 6 = A 12 . 

(iii) Every positive integer except 1 is a multiple of at least one prime number; hence 

U»€P A; = {2,3,4, ...} = N \ {1} 

1.28. Prove: Let {Ai: iGl} be an indexed class of sets and let i» G I. Then 

Ai G Aiq CI Uiei Ai 

Let fljgjAi; then x G A t for every iGl. In particular, x G A jo . Hence rijg/AjCA^. 
Now let y £ A i(i . Since i 0 GI, yG Ujgj A;. Hence A Sq C UA 4 . 

1.29. Prove (DeMorgan's law): For any indexed class { Ai'.iGl}, (UiAi) c = DiAt. 

(Ui Aj) c = {x : x & U ( A t } = {x : x £ A t for every i} — {x : x G Al for every i} = n 

1.30. Let oA be an algebra (o-algebra) of subsets of U. Show that: (i) U and 0 belong to 
oA ; and (ii) oA is closed under finite (countable) intersections. 

Recall that oA is closed under complements and finite (countable) unions. 

(i) Since cA is nonempty, there is a set A £ cA. Hence the complement A c £ cA, and the union 
U = A U A c £ cA. Also the complement 0 = U c £ cA. 

(ii) Let {A f } be a finite (countable) class of sets belonging to c A. By De Morgan’s law (Problem 1.29), 
(UjA?) c = HiA? 0 = n i A 1 . Hence n i A i belongs to c A, as required. 
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Supplementary Problems 

SETS, ELEMENTS, SUBSETS 

1.31. Write in set notation: 

(а) R is a subset of T. ( d) M is not a subset of 5. 

(б) x is a member of Y. (e) z does not belong to A. 

(c) The empty set. (/) R belongs to cA. 

1.32. Rewrite explicitly giving the elements in each set: 

(i) A — {x : x 2 — x —2 ~ 0 } 

(ii) B ~ {x : x is a letter in the word “follow”} 

(iii) C = {a? : x 2 = 9, x — 3 = 5} 

(iv) D = {x : x is a vowel} 

(v) E = {x : x is a digit in the number 2324} 

1.33. Let A = {1,2, ...,8,9}, B = {2,4,6,8}, C = {1,3,5,7,9}, D = {3,4,5} and E = {3,5}. 

Which sets can equal X if we are given the following information? 

(i) X and B are disjoint, (ii) XcD but X(j:B. (iii) XcA but X (f :C. (iv) Xc C but X <\_A. 

1.34. State whether each statement is true or false: 

(i) {1,4,3} = {3,4,1} (iii) 1 (£ {1,2} (v) {4}C{{4}} 

(ii) {3,1, 2} c {1,2, 3} (iv) {4}G{{4}} (vi) 0c{{4}} 

1.35. Let A — {1,0}. State whether or not each statement is correct: 

(i) {0}GA, (ii) 0 G A, (iii) {0} c A, (iv) OGA, (v) OcA. 

1.36. State whether each set is finite or infinite: 

(i) The set of lines parallel to the x axis. 

(ii) The set of letters in the English alphabet. 

(iii) The set of numbers which are multiples of 5. 

(iv) The set of animals living on the earth. 

(v) The set of numbers which are solutions of the equation x 27 + 26a; 18 — 17a; 11 + 7a; 3 — 10 = 0. 

(vi) The set of circles through the origin (0,0). 

SET OPERATIONS 

1.37. Let U = {a f b,c t d,ej t g}, A = {a,b,c,d,e}, B = {a,c,e,g} and C = {b, e f /, g}. Find: 

(i) A u C (iii) C\B (v) C c n A (vii) (A \R c ) c 

(ii) Bn A (iv) B C UC (vi) (A \ C) c (viii) (A n A c ) c 

1.38. In the Venn diagrams below, shade (i) W\V, (ii) V c U W, (iii) V n W c , (iv) V C \TF C . 



(a) (b) 

1.39. Prove: (a) A U B = (Ac n B c ) c ; (b) A \ B — A n B c . (Thus the union and difference operations can 
be defined in terms of the operations of intersection and complement.) 

1.40. Prove Theorem 1.3(ii): A c B if and only if A U B = B. 

1.41. Prove: If A n B = 0, then A C B c . 

1.42. Prove: A C \B C = B\A. 
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1.43. Prove: A cB implies A U (B\A) - B. 

1.44. (i) Prove: An(B\C) = (AnB)\(AnC). 

(ii) Give an example to show that A u (B \ C) -A (A u B) \ (A u Q. 


PRODUCT SETS 

1.45. Let W = {Mark, Eric, Paul) and let V = (Eric, David}. Find: 
(i) W XV, (ii) V X W, (iii) V 2 - V x V. 


1.46. 


Let A = {2, 3}, B = {1,3, 5} and C = (3,4}. 
find A X B X C. (See Problem 1.18.) 


Construct the “tree diagram” of AXBxC and then 


1.47. 


Let S - {a, b, c}, T - {b, c, d} and W = {a, d }. 
then find S X T X W. 


Construct the tree diagram of SxJ’xlf and 


1.48. 


Suppose that the sets V, W and Z have 3, 4 
of elements in (i) V X W X Z, (ii) ZXVXW, 


and 5 elements respectively, 
(iii) WX Z XV. 


Determine the number 


1.49. Let A - BnC. Determine if either statement is true: 

(i) A X A = (B X B) n (C X C), (ii) A X A = (B X Q n (C X B). 


1.50. Prove: A X (BuC) = (A X B) U (A X C). 


CLASSES OF SETS 


1.51. 


Let A n - {x : x is a multiple of n} — {w, 2n, 3 n, .. 
(i)A 2 nA 7 ; (ii) A 6 n A 8 ; (iii) A 3 UA 12 ; (iv) A 3 nA 12 ; 
where s,te N. (vii) Prove: If J c N is infinite, then 


where n £ N, the positive integers. Find: 
(v) A s U A st , where s,tG N; (vi )A s nA st> 
^iej A{ — 0. 


1.52. 

1.53. 


1.54. 

1.55. 


Find the power set <P(A) of A = {1,2,3,4} and the power set q>(B) of 5= {1, {2,3}, 4}. 

Let W - {1,2,3,4, 5,6}. Determine whether each of the following is a partition of W: 

(i) [{1, 3, 5}, {2,4}, {3, 6}] (iii) [{i, 5}, {2}, {4}, {3 f g}] 

(ii) [{1, 5}, {2}, {3, 6}] (iv) [{1,2,3,4, 5,6}] 

Find all partitions of 7= {1, 2, 3}. 

Let [A 1( A 2 , ...,A m ] and [B U B 2 , ...,B n ] be partitions of a set X. Show that the collection of sets 

l A i 0 B 3 : * = 1, ..w, j = 1, .. n] 
is also a partition (called the cross partition) of X. 


1.56. Prove: For any indexed class {A*: iel} and any set B , 

(a) Pu(n i A i ) = OiCBuAi), (6) Bn^A*) = u^BoA*). 

1.57. Prove (De Morgan's law): (f^A^c = u i A c i . 

1.58. Show that each of the following- is an algebra of subsets of U: 

(i) oA = {0, U}; (ii) <B = {0, A, Ac, U}; (iii) cp(XJ ), the power set of 17. 


1.59. 


^ ^- al & ebra s) °f subsets of U. Prove that the intersection c4 n«i s also 

an algebra (a-algebra) of subsets of U. 
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Answers to Supplementary Problems 

1.31. (a) RcT, (b) xG Y, (c) 0, (d) M£S, (e) z G A, (/) R G oA. 


1.32. (i) A = {—1,2}, (ii) B = { f,o,l,w }, (iii) C = 0, (iv) D — {a,e,i,o,u}, (v) E — {2,3,4}. 

1.33. (i) C and E, (ii) D and E, (iii) A, B and D, (iv) none. 


1.34. All the statements are true except (v). 

1.35. (i) incorrect, (ii) incorrect, (iii) correct, (iv) correct, (v) incorrect. 

1.36. (i) infinite, (ii) finite, (iii) infinite, (iv) finite, (v) finite, (vi) infinite. 


1.37. (i) AuC=U 

(ii) Bn A = {a,o, e} 

(iii) C\B = {b,f} 

(iv) B c uC = {b,d,e,f,g} 


(v) C c nA = {a,c,d} = C c 

(vi) {A \ C) c = {b,e,f,g} 

(vii) (A \ £0° = {b, d, f, g) 
(viii) (AnA c ) c = U 








Observe that V c U W = U and V n W c — 0 in case (6) where V C W. 




y c \ w c 


1.45. 

(i) 

WxV = 


(ii) 

VxW = 


(iii) 

VxV = 

1.46. 




{(Mark, Eric), (Mark, David), (Eric, Eric), (Eric, David), (Paul, Eric), (Paul, David)} 
{(Eric, Mark), (David, Mark), (Eric, Eric), (David, Eric), (Eric, Paul), (David, Paul)} 
{(Eric, Eric), (Eric, David), (David, Eric), (David, David)} 



(2, 1, 3) 
(2, 1, 4) 

(2, 3, 3) 
(2, 3, 4) 

(2, 5, 3) 
(2, 5, 4) 

(3, 1, 3) 
(3, 1, 4) 

(3, 3, 3) 
(3, 3, 4) 

(3, 5, 3) 
(3, 5, 4) 


The elements of A X B X C are the ordered triplets to the right of the tree diagram above. 
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a 

(a, b, a) 

d 

(a, b, d) 

a 

(a, c, a) 

d 

(a, c, d) 

a 

(a, d, a) 

d 

(a, d, d) 

a 

(b, b, a) 

d 

(6, b, d) 

a 

(b, c, a) 

d 

(b, c, d) 

a 

( b, d, a) 

d 

(b, d, d) 

a 

(c, b, a) 

d 

(c, b, d) 

a 

(c, e, a) 

d 

(c, c, d) 

a 

(c, d, a) 

d 

(c, d, d) 


The elements of S X T X W are the ordered triplets listed to the right of the tree diagram. 


1.48. Each has 60 elements. 

1.49. Both are true: A X A = (B X B) n (C X C) = (B X C) n {C X B). 

1.51. (i) A u , (ii) A24, (iii) A 3 , (iv) A 12 , (v) A s , (vi) A st . 

1.52. <P(B) = { B , {1, {2,3}}, {1,4}, {{2,3}, 4}, {1}, {{2,3}}, {4}, 0}. 


1.53. (i) no, (ii) no, (iii) yes, (iv) yes. 
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1.54. [{1,2,3}], [{1}, {2, 3}], [{2}, {1,3}], [{3}, {1,2}] and [{1}, {2}, {3}] 




Chapter 2 


Techniques of Counting 


INTRODUCTION 

In this chapter we develop some techniques for determining without direct enumeration 
the number of possible outcomes of a particular experiment or the number of elements in 
a particular set. Such techniques are sometimes referred to as combinatorial analysis. 

FUNDAMENTAL PRINCIPLE OF COUNTING 

We begin with the following basic principle. 

Fundamental Principle of Counting: If some procedure can be performed in fti dif¬ 
ferent ways, and if, following this procedure, a second procedure can be 
performed in n 2 different ways, and if, following this second procedure, a 
third procedure can be performed in different ways, and so forth; then the 
number of ways the procedures can be performed in the order indicated is the 
product ni • rt 2 m ns ... . 

Example 2.1: Suppose a license plate contains two distinct letters followed by three digits with 

the first digit not zero. How many different license plates can be printed? 

The first letter can be printed in 26 different ways, the second letter in 25 dif¬ 
ferent ways (since the letter printed first cannot be chosen for the second letter), 
the first digit in 9 ways and each of the other two digits in 10 ways. Hence 

26* 25-9- 10 * 10 = 585,000 
different plates can be printed. 


FACTORIAL NOTATION 

The product of the positive integers from 1 to n inclusive occurs very often in mathe¬ 
matics and hence is denoted by the special symbol n! (read “n factorial”): 

nl = 1 • 2 • 3.(« — 2 )(n — 1 )n 

It is also convenient to define 0! = 1. 


Example 2.2: 


Example 2.3: 


2! = 1*2 = 2, 3! =1*2*3 = 6, 4! = 1 • 2 • 3 • 4 = 24, 

5! = 5*4! = 5* 24 = 120, 6! = 6 • 5! = 6 • 120 = 720 


S’ 8 • 7 • 6 T 

£7 = * * = 8* 7 = 56 

6! 6! 


12 * 11*10 


12*11*10*9! _ 12] 
9! 9! 


PERMUTATIONS 

An arrangement of a set of 71 objects in a given order is called a permutation of the 
objects (taken all at a time). An arrangement of any r^n of these objects in a given 
order is called an r-jiermutation or a permutation of the n objects taken r at a time. 

Example 2.4: Consider the set of letters a, b, c and d. Then: 

(i) bdca, dcba and acdb are permutations of the 4 letters (taken all at a time); 

(ii) bad, adb, cbd and bca are permutations of the 4 letters taken 3 at a time; 

(iii) ad, cb, da and bd are permutations of the 4 letters taken 2 at a time. 
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The number of permutations of n objects taken r at a time will be denoted by 

P(n, r) 

Before we derive the general formula for P(n, r) we consider a special case. 


Example 2.5: Find the number of permutations of 6 objects, say a, b, c, d, e, /, taken three at a 

time. In other words, find the number of “three letter words” with distinct letters 
that can be formed from the above six letters. 


Let the general three letter word be represented by three boxes: 


Now the first letter can be chosen in 6 different ways; following this, the second 
letter can be chosen in 5 different ways; and, following this, the last letter can be 
chosen in 4 different ways. Write each number in its appropriate box as follows: 



Thus by the fundamental principle of counting there are 6*5*4 — 120 possible 
three letter words without repetitions from the six letters, or there are 120 permu¬ 
tations of 6 objects taken 3 at a time. That is, 


P(6,3) - 120 


The derivation of the formula for P(n , r) follows the procedure in the preceding example. 
The first element in an r-permutation of n-objeets can be chosen in n different ways; follow¬ 
ing this, the second element in the permutation can be chosen in n — 1 ways; and, following 
this, the third element in the permutation can be chosen in n — 2 ways. Continuing in this 
manner, we have that the rth (last) element in the r-permutation can be chosen in n — (r—1) = 
n — r + 1 ways. Thus 


7 ) ! 

Theorem 2.1: P(n,r ) = n(n — \)(n — 2) • • • (n — r + 1) = -' 

The second part of the formula follows from the fact that 


»(n —1)(« — 2) • • • (n — r +1) = n(n - 1)(n ~ 2 > - r > ! 

In the special case that r = n, we have 

P(n,ri) — n(n — l)(n — 2)* * -3*2*1 = n! 

Namely, 


n ! 


(n — r)! 


Corollary 2.2: There are n \ permutations of n objects (taken all at a time). 

Example 2.6: How many permutations are there of 3 objects, say, a, b and c? 

By the above corollary there are 3! = 1 • 2 • 3 = 6 such permutations. These 
are abc, acb, bac , bca, cab , cba . 


PERMUTATIONS WITH REPETITIONS 

Frequently we want to know the number of permutations of objects some of which are 
alike, as illustrated below. The general formula follows. 

Theorem 2.3: The number of permutations of n objects of which n\ are alike, riz are 
alike, ..., n T are alike is 

n\ 


n \! n 2 ! • * • n r ! 
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We indicate the proof of the above theorem by a particular example. Suppose we want 
to form all possible 5 letter words using the letters from the word DADDY. Now there 
are 5 != 120 permutations of the objects D lf A, D 2f D 3i Y where the three D ’s are dis¬ 
tinguished. Observe that the following six permutations 

D 1 D 2 DsAY f D 2 D 1 DsAY, D 3 D 1 D 2 AY, DiD 3 D 2 AY, D 2 D 3 DiAY, D 3 D 2 DiAY 

produce the same word when the subscripts are removed. The 6 comes from the fact that 
there are 31 = 8*2*1 = 6 different ways of placing the three D *s in the first three posi¬ 
tions in the permutation. This is true for each of the other possible positions in which the 
D f s appear. Accordingly there are 


different 5 letter words that can be formed using the letters from the word DADDY. 


Example 2.7: How many different signals, each consisting of 8 flags hung in a vertical line, 

can be formed from a set of 4 indistinguishable red flags, 3 indistinguishable white 
flags, and a blue flag? We seek the number of permutations of 8 objects of which 
4 are alike (the red flags) and 3 are alike (the white flags). By the above theorem, 
there are 

8! = 8-7-6-5-4-3-2-1 = 2g 

4!3! 4*3*2*1*3*2*1 

different signals. 


ORDERED SAMPLES 

Many problems in combinatorial analysis and, in particular, probability are concerned 
with choosing a ball from an urn containing n balls (or a card from a deck, or a person 
from a population). When we choose one ball after another from the urn, say r times, we 
call the choice an ordered sample of size r. We consider two cases: 

(i) Sampling with replacement . Here the ball is replaced in the urn before the next ball 
is chosen. Now since there are n different ways to choose each ball, there are by the 
fundamental principle of counting 

r times 

r \ 

n* n* n * * ' n = n T 
different ordered samples with replacement of size r. 

(ii) Sampling without replacement. Here the ball is not replaced in the urn before the next 
ball is chosen. Thus there are no repetitions in the ordered sample. In other words, 
an ordered sample of size r without replacement is simply an r-permutation of the 
objects in the urn. Thus there are 

n ! 

P(n,r) = n(n — l)(n — 2) •••(«. — r +1) = ^ _ r y 
different ordered samples of size r without replacement from a population of n objects. 


Example 2.8: In how many ways can one choose three cards in succession from a deck of 52 cards 

(i) with replacement, (ii) without replacement? If each card is replaced in the 
deck before the next card is chosen, then each card can be chosen in 52 different 
ways. Hence there are 

52 *52-52 = 52 3 = 140,608 
different ordered samples of size 3 with replacement. 
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On the other hand if there is no replacement, then the first card can be chosen 
in 52 different ways, the second card in 51 different ways, and the third and last 
card in 50 different ways. Thus there are 

52 * 51 - 50 = 132,600 

different ordered samples of size 3 without replacement. 


BINOMIAL COEFFICIENTS AND THEOREM 


The symbol ( r ) > read “ nCr”, where r and n are positive integers with r — », is defined 

as follows: ' ' v , 

(n\ _ n(n — l)(ft — 2) • • • (n — r + 1) 

W ~ 1 • 2 • 3 ■ • • (r — l)r 

These numbers are called the binomial coefficients in view of Theorem 2.5 below. 


Example 2.9: 


8\ 8*7 


9 • 8 ♦ 7 • 6 
1 • 2 • 3 • 4 


12 • 11* 10 • 9 • 8 
1 • 2 • 3 • 4 • 5 


Observe that y r j has exactly r factors in both the numerator and denominator. Also, 

/ n\ _ ft(ft — 1) • • - (ft — r + 1) _ ft(ft — 1) •••(« — r 4- l)(ft — r )! _ ft! 

\r) ~ 1-2-3-• -(r-l)r ~ 1-2-3-•-(r-l)r(ft-r)! ~ rl(n-r)\ 

Using this formula and the fact that n — (n — r) = r, we obtain the following important 
relation. 


Lemma 2.4: 


or, in other words, if a + b = n then 


Example 2.10: 


10\ _ 10-9*8*7*6*5*4 
7 / 1*2*3*4*5*6*7 


= 120 or 


10 \ = 10*9*8 
3 / “ 1*2*3 


= 120 


Note that the second method saves both space and time. 


Remark: Motivated by the second formula for and the fact that 0! = 1, we define: 

(o) = oih = 1 and - in particular, Q = = 1 

The Binomial Theorem, which is proved (Problem 2.18) by mathematical induction, gives 
the general expression for the expansion of (a + 6)”. 


Theorem 2.5 (Binomial Theorem): 

(a + b)” = 2 (”) a”~ r b r 

= a” + na”~ 1 b + 0 ^ a"~ 2 b 2 + • • • + nab n_1 + b” 

1 * 2 

Example 2.11: (a + 6) 5 = o 5 + 5a 4 b + a 3 6 2 + ^ a 2 b 3 + 5ab 4 + 6 5 

= a 5 + 5a 4 b + 10a 3 6 2 + 10a 2 b 3 + 5ab 4 + b 5 

= a 3 + 6a 5 6 + a 4 b 2 + f- L | 1 l“ 3 6 3 + + 6ab 3 + 6 8 

1 • 2 1*2*3 1*2 

= a 6 + 6a*b + 15a 4 6 2 + 20a 3 6 3 + 15a 2 b 4 + 6ab* + 6« 


(a + 6) 6 
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The following properties of the expansion of ( a + b) n should be observed: 

(i) There are n + 1 terms. 

(ii) The sum of the exponents of a and b in each term is n. 

(iii) The exponents of a decrease term by term from n to 0; the exponents of b increase 
term by term from 0 to n. 

(iv) The coefficient of any term is K j where It is the exponent of either a or 6. (This fol¬ 
lows from Lemma 2.4.) ' ' 

(v) The coefficients of terms equidistant from the ends are equal. 

We remark that the coefficients of the successive powers of a + b can be arranged in a 
triangular array of numbers, called Pascal’s triangle, as follows: 


(a +6)o = l 

(o + 6) 1 — a + b 

(a + 6) 2 = a* -h 2 ab + 62 

(a+ 6)3 = a s + 3a 2 6 + 3a6 2 + 63 

(a + 6) 4 = a 4 + 4a 2 6 + 6a 2 6 2 + 4a6 3 + 6 4 

(a + 6)5 = a 5 + 5a 4 6 + 10 a 3 6 2 + 10a 2 63 + 5a6 4 + 65 

(a + 6) 6 ” a 6 + 6a 5 6 + 15a 4 6 2 + 20a 3 6 3 + 15a 2 6 4 + 6a6 5 + 6 6 



Pascal’s triangle has the following interesting properties. 

(a) The first number and the last number in each row is 1. 

(b) Every other number in the array can be obtained by adding the two numbers appearing 
directly above it. For example: 10 = 4 + 6, 15 = 5 + 10, 20 = 10 + 10. 

We note that property (6) above is equivalent to the following theorem about binomial 
coefficients. 


Theorem 2.6: 


n + 1 


r — 1 


Now let n\y n 2 , ..., n r be nonnegative integers such that «i + n 2 + * * • + n r = n. Then 


the expression 


For example, 

( 7 

V 2, 3, 2 


ni, n 2 , . .., n r 


is defined as follows: 




_ nl _ 

ni ! n 2 ! • • • n r ! 


2 ! 3 ! 2! 


= 210 


4, 2, 2, 0 


4 ! 2 ! 2 ! 0 ! 


= 420 


These numbers are called the multinomial coefficients in view of the following theorem 
which generalizes the binomial theorem. 



Theorem 2.7: (at + a 2 +- • • * + a r ) 
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COMBINATIONS 

Suppose we have a collection of n objects. A combination of these n objects taken r at 
a time , or an recombination , is any subset of r elements. In other words, an r-combination 
is any selection of r of the n objects where order does not count. 

Example 2.12: The combinations of the letters a,b,c,d taken 3 at a time are 

{a, b, c}, {a, by d}, {a, c, d}, {b, c, d} or simply abc, abd, acd f bed 
Observe that the following combinations are equal: 

abc, acb, bac, bca, cab, eba 
That is, each denotes the same set {a, b, c }. 

The number of combinations of n objects taken r at a time will be denoted by 

C(n, r) 

Before we give the general formula for C(n , r), we consider a special case. 


Example 2.13: We determine the number of combinations of the four letters a,b,c,d taken 3 at a 
time. Note that each combination consisting of three letters determines 3! = 6 
permutations of the letters in the combination: 


Combinations 

Permutations 

abc 

abc, acb, bac, bca, cab, eba 

abd 

abd, adby bad, bda, dab, dba 

acd 

acd, adc, cad, eda, dac, dca 

bed 

bed, bde, cbd, cdb, dbc, deb 


Thus the number of combinations multiplied by 3! equals the number of permu¬ 
tations: 

C(4,3) • 3! = P(4,3) or C(4,3) = 

Now P(4, 3) = 4 • 3 • 2 = 24 and 3! = 6; hence C(4, 3) = 4 as noted above. 


Since each combination of n objects taken r at a time determines r! permutations of the 
objects, we can conclude that 

P(n, r) = rl C(n, r) 

Thus we obtain 

Theorem 2.8: C(n, r) = EiZk li — — —— 

r ! r ! (n — r )! 

Recall that the binomial coefficient ( J was defined to be .. — — 77 : hence 

\r J r ! (n — r)! 


C(n,r) = (^j 


We shall use C(n, r) and ( j interchangeably. 
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Example 2.14: How many committees of 3 can be formed from 8 people? Each committee is 
essentially a combination of the 8 people taken 3 at a time. Thus 


C( 8,3) = 



different committees can be formed. 


8*7*6 

1*2*3 


56 


ORDERED PARTITIONS 

Suppose an urn A contains seven marbles numbered 1 through 7, We compute the num¬ 
ber of ways we can draw, first, 2 marbles from the urn, then 3 marbles from the urn, and 
lastly 2 marbles from the urn. In other words, we want to compute the number of ordered 
partitions 

(Ai, A 2 , A3) 


of the set of 7 marbles into cells Ai containing 2 marbles, A 2 containing 3 marbles and A 3 
containing 2 marbles. We call these ordered partitions since we distinguish between 


({1,2} {3,4,5}, {6,7}) and ({6,7}, {3,4,5}, {1,2}) 
each of which yields the same partition of A. 


Since we begin with 7 marbles in the urn, there are wa ys °f drawing the first 2 
marbles, i.e. of determining the first cell Ax', following this, there are 5 marbles left in the 
urn and so there are (^ j ways of drawing the 3 marbles, i.e. of determining A 2 ; finally, 

W /2\ 

there are 2 marbles left in the urn and so there are ( 2 ) wa y s of determining the last cell 
A3. Thus there are ^ / 


/7\/5\/2\ _ 7*6 5*4*3 2*1 _ 

V2 /V 3 /\ 2/ “ 1 * 2 * 1 • 2 • 3 * 1 • 2 — 

different ordered partitions of A into cells Ai containing 2 marbles, A 2 containing 3 marbles, 
and A 3 containing 2 marbles. 


Now observe that 

7! 5! 2! 7! 

V 2 ) V3/\2 ) ~ 2 !5!* 3!2 !* 2!0! = 2!3!2! 

since each numerator after the first is cancelled by the second term in the denominator of 
the previous factor. In a similar manner we prove (Problem 2.28) 


Theorem 2.9: Let A contain n elements and let ni, n 2 , ... ,n r be positive integers with 
ni + n 2 + • • • 4 -n r ~n. Then there exist 

_ nl _ 

n \! n 2 ! nz ! • • • Ur ! 

different ordered partitions of A of the form (Ai, A 2 , .. ., A r ) where At con¬ 
tains ni elements, A 2 contains n 2 elements, ..., and A r contains nr elements. 

Example 2.15: In how many ways can 9 toys be divided between 4 children if the youngest child 
is to receive 3 toys and each of the other children 2 toys? 

We wish to find the number of ordered partitions of the 9 toys into 4 cells 
containing 3, 2, 2 and 2 toys respectively. By the above theorem, there are 

9! 

3! 2! 2! 2! “ 7560 

such ordered partitions. 



CHAP. 2] 


TECHNIQUES OF COUNTING 


23 


TREE DIAGRAMS 

A tree diagram is a device used to enumerate all the possible outcomes of a sequence of 
experiments where each experiment can occur in a finite number of ways. The construction 
of tree diagrams is illustrated in the following examples. 

Example 2.16: Find the product set A X B X C where A = {1,2}, B — {a, b, c} and C = {3,4). 
The tree diagram follows: 

(1, o. 3) 

(1, <». 4) 

a, b, 3) 

(1, b, 4) 

a,3) 

(1. e, 4) 

(2. a, 3) 

(2, a. 4) 

(2. b, 3) 

(2, b, 4) 

(2, e, 3) 

(2. c, 4) 

Observe that the tree is constructed from left to right, and that the number of 
branches at each point corresponds to the number of possible outcomes of the next 
experiment. 



Example 2.17: Mark and Eric are to play a tennis tournament. The first person to win two games 
in a row or who wins a total of three games wins the tournament. The following 
diagram shows the possible outcomes of the tournament. 



Observe that there are 10 endpoints which correspond to the 10 possible out¬ 
comes of the tournament: 

MM, MEMM, MEMEM, MEMEE, MEE, EMM, EMEMM, EMEME, EMEE, EE 

The path from the beginning of the tree to the endpoint indicates who won which 
game in the individual tournament. 
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Solved Problems 


FACTORIAL 

2.1. Compute 4!, 5!, 6!, 7! and 8!. 

4! = 1 *2 • 3 -4 = 24 7! = 7 • 6! = 7 • 720 = 5040 

5! = 1*2*3*4*5 = 5*4! = 5*24 = 120 8! = 8*7! = 8-5040 = 40,320 

6! = 1 • 2 • 3 • 4 • 5 • 6 = 6-5! = 6- 120 = 720 


2 . 2 . 


1Q| 7! 

Compute: (i) yyj, (ii) JoT* 


(i) 


13! 

11 ! 


13! 

11 ! 


13 * 12 * 11 * 10 * 9 * 8 * 7 * 6 * 5 * 4 * 3 * 2 * 1 
11*10*9*8*7*6*5*4*3*2*1 
13*12*11! 


11 ! 


7’ 

(ii) wj = 


71 


10 • 9 • 8 • 7! 10*9*8 


13*12 = 156 


1 

720 


= 13*12 = 156 


2.3. Simplify: (i) (ii) 


(i) 


(ii) 


nl _ n(n — 1 )(n — 2)* • • 3 * 2 * 1 
(n-1)! - (n — l)(n — 2)• • • 3 • 2 • 1 


+ 2 )! 


or, simply, 


n or, simply, 

jn -r 2)(n -f l)?i(n — l)(ti — 2) ■ * * 3 * 2 * 1 __ 


nl _ n(n —1)! 


n(n — l)(n — 2) • • • 3 * 2 • 1 

(n + 2)! 
nl 


(n — 1)! (n~l)! 

(7i + 2)(n + l) = % 2 + 3?i + 2 


= {n + 2){n + 1 ) = TO 2 + 3w+2 


W! 


PERMUTATIONS, ORDERED SAMPLES 

2.4. If repetitions are not permitted, (i) how many 3 digit numbers can be formed from 
the six digits 2, 3, 5, 6, 7 and 9 ? (ii) How many of these are less than 400 ? (iii) How 
many are even? (iv) How many are odd? (v) How many are multiples of 5? 


In each case draw three boxes 


□□□ 


to represent an arbitrary number, and then 


write in each box the number of digits that can be placed there. 


(i) The box on the left can be filled in 6 ways; following this, the middle box can be filled in 

5 ways; and, lastly, the box on the right can be filled in 4 ways: ^ 5 4. Thus there 

are 6*5*4 = 120 numbers. - —-— ' -* 


(ii) The box on the left can be filled in only 2 ways, by 2 or 3, since each number must be less 
than 400; the middle box can be filled in 5 ways; and, lastly, the box on the right can be filled 

in 4 ways: J 2 j j~ 5 j |~4 | . Thus there are 2*5*4 = 40 numbers. 

(iii) The box on the right can be filled in only 2 ways, by 2 or 6, since the numbers must be even; 

the box on the left can then be filled in 5 ways; and, lastly, the middle box can be filled in 

4 ways: 000- Thus there are 5*4*2 = 40 numbers. 

(iv) The box on the right can be filled in only 4 ways, by 3, 5, 7 or 9, since the numbers must be 

odd; the box on the left can then be filled in 5 ways; and, lastly, the box in the middle can 

be filled in 4 ways: 5 4 4 . Thus there are 5*4*4 = 80 numbers. 
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(v) The box on the right can be filled in only 1 way, by 5, since the numbers must be multiples 
of 5; the box on the left can then be filled in 5 ways; and, lastly, the box in the middle can 

be filled in 4 ways: |~5 | | 4 j |~ 1 | . Thus there are 5*4*1 = 20 numbers. 


2.5. In how many ways can a party of 7 persons arrange themselves (i) in a row of 
7 chairs? (ii) around a circular table? 

(i) The seven persons can arrange themselves in a row in 7*6*5*4*3*2*1 = 7! ways. 

(ii) One person can sit at any place in the circular table. The other six persons can then arrange 
themselves in 6*5*4*3*2*1 = 6! ways around the table. 

This is an example of a circular permutation . In general, n objects can be arranged in 
a circle in (n — 1 )(n — 2) • • • 3 • 2 • 1 = (n — 1)! ways. 


2.6. (i) In how many ways can 3 boys and 2 girls sit in a row? (ii) In how many ways 

can they sit in a row if the boys and girls are each to sit together? (iii) In how 
many ways can they sit in a row if just the girls are to sit together? 

(i) The five persons can sit in a row in 5 • 4 • 3 • 2 • 1 =5! — 120 ways, 

(ii) There are 2 ways to distribute them according to sex: BBBGG or GGBBB. In each case the 
boys can sit in 3 * 2 * 1 = 3! = 6 ways, and the girls can sit in 2 * 1 = 2! = 2 ways. Thus, 
altogether, there are 2 • 3! • 2! = 2*6*2 = 24 ways. 

(iii) There are 4 ways to distribute them according to sex: GGBBB, BGGBB, BBGGB, BBBGG. 
Note that each way corresponds to the number, 0, 1, 2 or 3, of boys sitting to the left of the 
girls. In each case, the boys can sit in 3! ways, and the girls in 2! ways. Thus, altogether, 
there are 4*3!*2! — 4*6*2 = 48 ways. 


2.7. How many different signals, each consisting of 6 flags hung in a vertical line, can 
be formed from 4 identical red flags and 2 identical blue flags? 

6 ! 

This problem concerns permutations with repetitions. There are y =15 signals since 
there are 6 flags of which 4 are red and 2 are blue. 


2.8. How many distinct permutations can be formed from all the letters of each word: 
(i) them, (ii) unusual, (iii) sociological? 

(i) 4! = 24, since there are 4 letters and no repetitions. 

7T 

(ii) = 840, since there are 7 letters of which 3 are u. 

121 

(iii) - "i 2 ~ | 2 ) * s i nce there are 12 letters of which 3 are o, 2 are c, 2 are i, and 2 are L 


2.9. (i) In how many ways can 3 Americans, 4 Frenchmen, 4 Danes and 2 Italians be 

seated in a row so that those of the same nationality sit together? 

(ii) Solve the same problem if they sit at a round table. 

(i) The 4 nationalities can be arranged in a row in 4! ways. In each case the 3 Americans can 
be seated in 3! ways, the 4 Frenchmen in 4! ways, the 4 Danes in 4! ways, and the 2 Italians 
in 2! ways. Thus, altogether, there are 4! 3! 4! 4! 2! = 165,888 arrangements. 

(ii) The 4 nationalities can be arranged in a circle in 3! ways (see Problem 14.4 on circular 
permutations). In each case the 3 Americans can be seated in 3! ways, the 4 Frenchmen in 
4! ways, the 4 Danes in 4! ways, and the 2 Italians in 2! ways. Thus, altogether, there are 
3! 3! 4! 4! 2! = 41,472 arrangements. 
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2.10. Suppose an urn contains 8 balls. Find the number of ordered samples of size 3 
(i) with replacement, (ii) without replacement. 

(i) Each ball in the ordered sample can be chosen in 8 ways; hence there are 8 * 8 * 8 ~ 8 3 = 512 
samples with replacement. 

(ii) The first ball in the ordered sample can be chosen in 8 ways, the next in 7 ways, and the last 
in 6 ways. Thus there are 8*7*6 = 836 samples without replacement. 

2.11. Find n if (i) P(n, 2) = 72, (ii) P(n, 4) = 42P(n, 2), (iii) 2 P{n, 2) + 50 = P{2n, 2). 

(i) P(n, 2) = n(n — 1) = n 2 — n; hence n 2 — n = 72 or n 2 — n —72 = 0 or (n — 9)(n + 8) = 0. 

Since n must be positive, the only answer is n = 9. 

(ii) P(n, 4) = n(n — l)(n — 2)(n — 3) and Pin, 2) = n(n — 1). Hence 

n(n — l)(n — 2)(n — 3) = 42rt(n— 1) or, if n ¥* 0, 9* 1, (n—2)(n — 3) = 42 

or n 2 — 5n + 6 = 42 or n 2 — bn — 36 = 0 or {n — 9)(n + 4) = 0 
Since n must be positive, the only answer is n — 9. 

(iii) P(n, 2) = n(n — 1) = n 2 — n and P(2n f 2) = 2n(2n — 1) = 4w 2 — 2w. Hence 

2(n 2 — n) 4- 50 = 4n 2 — 2n or 2n 2 ~ 2n + 50 = 4n 2 — 2 n or 50 = 2 n 2 or n 2 = 25 

Since n must be positive, the only answer is n — 5. 


BINOMIAL COEFFICIENTS AND THEOREM 

2.12. Compute: (i) (g 6 ), (ii) (i») 

Recall that there are as many factors in the numerator as in the denominator. 



16 • 15 • 14 
1*2*3 


560 



15*14*13*12*11 
1 • 2 • 3 * 4 * 5 


3003 



12*11*10*9 
1 • 2 • 3 • 4 
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2.15. Expand and simplify: (x 2 — 2 1 /) 6 . 


(x 2 — 2j/) 6 = (*2)6 + i(*i)B ( _ 2 „) + |^|(#*)4(-2y)* + f^|(**)«(-2»)« 

+ f ||(* 2 ) 2 (- 22/) 4 + | (* 2 )(- 2 v ) 5 + (— 2 y ) 6 
= # 12 — 12x 10 t/ + 60x 8 t/ 2 — 160# 6 i/ 3 4* 240# 4 t/ 4 — 192x 2 i/ 5 4 64y 6 


2.16. Prove: 2 4 = 16 = 


0 + [1 


+ {3 + U 


Expand (1 + l) 4 using the binomial theorem: 


a+i). = + (j)i*i. + ( 4 )i«i« + (‘)i.i> + (:)i. 

- G)♦ C)♦ G ) 4 ( 1 )♦ C) 


2.17. Prove Theorem 2.6: 


n 4* T 


Now - ) + ) = 7 - 7 T"»—r—- tttt + “?—zr 1 —n. To obtain the same denomi- 

\r —1/ \r/ (r — 1)! • (n — r 4 1)! rl'(n — r)l 

T 71 — r 4- 1 

nator in both fractions, multiply the first fraction by — and the second fraction by . Hence 

/ n \ i f ’A „ _ r • n\ _ (n — r + 1 ) * n\ 

yr — lj \r) r • (r — 1)! • (n — r 4 1)! r ! • (n — r 4 1) • (n — r)! 

r • n ! (n — r 4 1) • n ! 

~ r!(?i — r + 1 )! r!(n — r+ 1 )! 

r»w! + (w-r+ 1 ) •»! _ [r+ (w-r+ 1 )] »w! 

r! (n — r + 1 )! r!(n —r + 1 )! 

(n+ 1 )«! _ (« + !)! _ /»+A 

rl(n — r 4 1 )! r!(?i —r + 1 )! V r J 


2.18. Prove the Binomial Theorem 2.5: (a + &) n = 2 a n ~ r b r . 

The theorem is true for n = 1, since 

2 ^a 1 ' r 6 r = ^ a 1 6° + a° b 1 = a+b = (a+b) 1 

We assume the theorem holds for (a 4- 6) n and prove it is true for (a + 6) n+1 . 

(a+6) n+1 = (a+6)(a+6) n 

= (a+b) a n + ^”^a n - 1 b + ••• + ^ - i) a "~ r+1 6 ' -1 

+ (^ja n ~ r b r + ••• + ^jab»-> + b n 

Now the term in the product which contains b r is obtained from 

b ^ % i) a "~ r+1 &f ~ 1 J + a (^ l ^j an ~ rbr = ^ U 1 ^a rt ~ r+1 & r + (^^a” _r+1 & r 

= [(.:,) + (:)]«-'*■»' 
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But, by Theorem 2.6, 


, Thus the term containing b r is 


Note that (a + b)(a + b) n is a polynomial of degree n + 1 in b. Consequently, 

n+i / n _j_ 

(a + 6) w + 1 = (a + 6)(a + 6)« = 2 ( r Ja*- r+1 br 

which was to be proved. 


a n-r + l fcr 


2.19, Compute the following multinomial coefficients: 


^ (3, 2, 1) * ^ (4,2,2,0) ’ ( U1 ) (5,3,2, 


6 \ _ _ 6! _ 6-5-4-3-2-1 

3,2,1/ 3! 2! 1! 3*2»1»2*1*1 


8*7*6*5*4*3*2*1 


4 , 2 , 2 , 0 J 4! 2 ! 2 ! 0 ! 4*3*2*1*2*1*2*1*1 


= 420 


(iii) The expression f j has no meaning since 5 + 34-2 + 2^ 10. 

\5, 3, 2, 2/ 


COMBINATIONS 

2.20. In how many ways can a committee consisting of 3 men and 2 women be chosen 
from 7 men and 5 women? 

The 3 men can be chosen from the 7 men in ^ ways, and the 2 women can be chosen from the 
5 women in f ^ ways. Hence the committee can be chosen in = - J * ~1 ^ ~ = 350 ways. 


2.21. A delegation of 4 students is selected each year from a college to attend the National 
Student Association annual meeting, (i) In how many ways can the delegation be 
chosen if there are 12 eligible students? (ii) In how many ways if two of the 
eligible students will not attend the meeting together? (iii) In how many ways if 
two of the eligible students are married and will only attend the meeting together? 


(i) The 4 students can be chosen from the 12 students in 


12 • 11 • 10 • 9 _ 

= T-2-‘3-4" = 495 wa y s * 


(ii) Let A and B denote the students who will not attend the meeting together. 

Method 1. , ■> 

If neither A nor B is included, then the delegation can be chosen in ( 


/10 \ 10*9*8*7 

If neither A nor B is included, then the delegation can be chosen in ( 1 = 1 . g . 3.4 = 

210 ways. If either A or B , but not both, is included, then the delegation can be chosen in 


10*9*8 


210 + 240 = 450 ways. 


240 ways. Thus, altogether, the delegation can be chosen in 


Method 2. 

If A and B are both included, then the other 2 members of the delegation can be chosen in 

/io\ 

\ 2 / ~ ^ ways. Thus there are 495 — 45 = 450 ways the delegation can he chosen if 
A and B are not both included. 

(iii) Let C and D denote the married students. If C and D do not go, then the delegation can be 

/io\ 

chosen in ( J = 210 ways. If both C and D go, then the delegation can be chosen in 

Ao\ w 

[ ) = 45 ways. Altogether, the delegation can be chosen in 210 + 45 = 255 ways. 
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2.22. A student is to answer 8 out of 10 questions on an exam, (i) How many choices 
has he? (ii) How many if he must answer the first 3 questions? (iii) How many 
if he must answer at least 4 of the first 5 questions? 

(i) The 8 questions can be selected in = f = 45 ways. 


(ii) 

(iii) 


If he answers the first 3 questions, then he can choose the other 5 questions from the last 


7 questions in 


7-6 

fTJ = 21 ways * 


If he answers all the first 5 questions, then he can choose the other 3 questions from the last 

V 


5 in 


= 10 ways. On the other hand, if he answers only 4 of the first 5 questions, then 


he can choose these 4 in 


= 5 ways, and he can choose the other 4 questions 


from the last 5 in 


— 5 ways; hence he can choose the 8 questions in 5*5 — 


25 ways. Thus he has a total of 35 choices. 


2.23. Find the number of subsets of a set X containing n elements. 

Method 1. 

The number of subsets of X with r — n elements is given by 

o) + (l) + ( 2 ) + ••• + («-: 

subsets of X. The above sum (Problem 2.51) is equal to 2 n , i.e. there are 2 n subsets of X. 

Method 2. 

There are two possibilities for each element of X: either it belongs to the subset or it doesn't; 
hence there are 

n times 

_A_ 

2*2 . 2 = 2 n 

ways to form a subset of X y i.e. there are 2 n different subsets of X. 


. Hence, altogether, there are 


+ 


2.24. In how many ways can a teacher choose one or more students from six eligible 
students ? 

Method 1. 

By the preceding problem, there are 2 6 = 64 subsets of the set consisting of the six students. 
However, the empty set must be deleted since one or more students are chosen. Accordingly there 
are 2 6 — 1 = 64 — 1 = 63 ways to choose the students. 

Method 2. 

Either 1,2, 3,4, 5 or 6 students are chosen. Hence the number of choices is 
(!) + (!) + (;) + (;) + (;) + (!) = • +«+ z ° + 15 + 6 +1 - 63 

ORDERED AND UNORDERED PARTITIONS 

2.25. In how many ways can 7 toys be divided among 3 children if the youngest gets 
3 toys and each of the others gets 2? 

We seek the number of ordered partitions of 7 objects into cells containing 3, 2 and 2 objects, 

7! 

respectively. By Theorem 2.9, there are g f g T gi = 210 such partitions. 




30 


TECHNIQUES OF COUNTING 


[CHAP. 2 


2.26. There are 12 students in a class. In how many ways can the 12 students take 
3 different tests if 4 students are to take each test? 


Method 1. 

We seek the number of ordered partitions of the 12 students into cells containing 4 students 

12 1 

each. By Theorem 2.9, there are ^ 7 ^ y = 34,650 such partitions. 


Method 2. 

There are 

8 " 


12 


ways to choose 4 students to take the first test; following this, there are 


ways to choose 4 students to take the second test. The remaining students take the third test. 


Thus, altogether, there are 


= 495 • 70 = 34,650 ways for the students to take the tests. 


2.27. In how many ways can 12 students be partitioned into 3 teams, Ai, A 2 and As, so that 
each team contains 4 students? 

Method 1. 

Observe that each partition {A t ,A 2> A A of the students can be arranged in 3! - 6 ways 

12 ! 

as an ordered partition. Since (see the preceding problem) there are ^ j 4 ? 4 j — 34,650 such ordered 
partitions, there are 34,650/6 = 5775 (unordered) partitions. 


Method 2. 

Let A denote one of the students. Then there are 


ways to choose 3 other students to be 


on the same team as A. Now let B denote a student who is not on the same team as A; then there 
are ways to choose 3 students of the remaining students to be on the same team as B. The 


remaining 4 students constitute the third team. Thus, altogether, there are 
165 • 35 = 5775 ways to partition the students. 


11 


2.28. 


Prove Theorem 2.9: Let A contain n elements and let Wi, U 2 , ..., n r be positive integers 
with m 4- tt 2 + * * • + n T = n. Then there exist 

_ nl _ 

tti! ti 2 ! na ! * • • n r ! 


different ordered partitions of A of the form {Ai, A 2 , ... ,A r ) where A% contains ni 
elements, A 2 contains n 2 elements, . .., and A r contains n r elements. 

We begin with n elements in A; hence there are ( W ) ways of selecting the cell A v Following 

W /»-*A , , . 

this, there are n — elements left, i.e. in A\A X , and so there are ( ^ ) ways of selecting 

A 2 . Similarly, for i = 3, there are ( 1 ) ways of selecting A { . Thus 

there are ' 1 



different ordered partitions of A. Now (*) is equal to 

n \ — (n — % — - • * 

nxl (n — nx )! ’ n 2 \ (n — n x — n 2 )! n r \ (n — — • * • — n r )! 


But this is equal to --- 7 since each numerator after the first is cancelled by the second 

rixl n 2 l m ’' n r ' 

term in the denominator and since (n — ~~ • * * — 7i r )! = 0! = 1. Thus the theorem is proved. 
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TREE DIAGRAMS 

2.29. Construct the tree diagram for the number of permutations of {a, b, c}. 



abc 

acb 

bac 

bca 

cab 

cba 


The six permutations are listed on the right of the diagram. 


2.30. A man has time to play roulette at most five 
times. At each play he wins or loses a dollar. 
The man begins with one dollar and will stop 
playing before the five times if he loses all his 
money or if he wins three dollars, i.e. if he has 
four dollars. Find the number of ways that the 
betting can occur. 

The tree diagram on the right describes the way the 
betting can occur. Each number in the diagram denotes 
the number of dollars the man has at that point. Observe 
that the betting can occur in 11 different ways. Note 
that he will stop betting before the five times are up in 
only three of the cases. 




I 

I Supplementary Problems 


FACTORIAL 

2.31. Compute: 

2.32. Compute: 

! 

2.33. Simplify: 


(i) 9!, (ii) 10!, (hi) 11! 


16! 


14! 


8! 


10 ! 


(i) jjj, (ii) jjf, (iii) jjy, (iv) jgj. 


n (n + l)l .... 

(0 “.I ■”> (“) 


n ! 


(n - 2)! ’ 


,..., («— U! 

(,11) QT+2)T’ 


(iv) 


(n — r -f 1) l 
(n — r — 1)! * 


PERMUTATIONS 

2.34. (i) How many automobile license plates can be made if each plate contains 2 different letters 

followed by 3 different digits? (ii) Solve the problem if the first digit cannot be 0. 


2.35. There are 6 roads between A and B and 4 roads between B and C. 

(i) In how many ways can one drive from A to C by way of B? 

(ii) In how many ways can one drive roundtrip from A to C by way of R? 

(iii) In how many ways can one drive roundtrip from A to C without using the same road more 
than once? 


I 


oc<iwti<oc^c<i^ 

V V V V 
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2.36. Find the number of ways in which 6 people can ride a toboggan if one of three must drive. 

2.37. (i) Find the number of ways in which five persons can sit in a row. 

(ii) How many ways are there if two of the persons insist on sitting next to one another? 

2.38. Solve the preceding problem if they sit around a circular table. 

2.39. (i) Find the number of four letter words that can be formed from the letters of the word HISTORY, 

(ii) How many of them contain only consonants? (iii) How many of them begin and end in a 

consonant? (iv) How many of them begin with a vowel? (v) How many contain the letter Y? 
(vi) How many begin with T and end in a vowel? (vii) How many begin with T and also contain S? 
(viii) How many contain both vowels? 

2.40. How many different signals, each consisting of 8 flags hung in a vertical line, can be formed from 
4 red flags, 2 blue flags and 2 green flags? 

2.41. Find the number of permutations that can be formed from all the letters of each word: (i) queue, 
(ii) committee, (iii) proposition, (iv) baseball. 

2.42. (i) Find the number of ways in which 4 boys and 4 girls can be seated in a row if the boys and 

girls are to have alternate seats. 

(ii) Find the number of ways if they sit alternately and if one boy and one girl are to sit in 
adjacent seats. 

(iii) Find the number of ways if they sit alternately and if one boy and one girl must not sit in 
adjacent seats. 

2.43. Solve the preceding problem if they sit around a circular table. 

2.44. An urn contains 10 balls. Find the number of ordered samples (i) of size 3 with replacement, 
(ii) of size 3 without replacement, (iii) of size 4 with replacement, (iv) of size 5 without replacement. 

2.45. Find the number of ways in which 5 large books, 4 medium-size books and 3 small books can be 
placed on a shelf so that all books of the same size are together. 

2.46. Consider all positive integers with 3 different digits. (Note that 0 cannot be the first digit.) 
(i) How many are greater than 700? (ii) How many are odd? (iii) How many are even? (iv) How 
many are divisible by 5? 

2.47. (i) Find the number of distinct permutations that can be formed from all of the letters of the word 
ELEVEN, (ii) How many of them begin and end with E? (iii) How many of them have the 3 E's 
together? (iv) How many begin with E and end with N? 
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COMBINATIONS 

A class contains 9 boys and 3 girls, (l) In how many ways can the teacher choose a committee of 4? 
one girl? COnta]n at ^ ° ne ^ (iii > How of them will contain exactly 


2.56. 


2.57. 


2.58. 


2.59. 


2.60. 


an has 11 close friends, (i) In how many ways can she invite 5 of them to dinner? (ii) In 
ow many ways if two of the friends are married and will not attend separately? (iii) In how 
many ways if two of them are not on speaking terms and will not attend together? 

There are 10 points A, B in a plane, no three on the same line, (i) How many lines are 
determined by the points? (ii) How many of these lines do not pass through A ,i) How 

pointirSn aredeter nr d by thepoints? (iv) Jfi ,Z 

point A . (v) How many of these triangles contain the side AB*> 

manvtf n L iS n!?, U X*? I* ° n an <*> many choices has he? (ii) How 

many if he must answer the first two questions? (iii) How many if he must answer the first or 

second question but not both? (iv) How many if he must answer exactly 3 of the first 5 questions’ 
(v) How many if he must answer at least 3 of the first 5 questions? questions. 

t7V S /l ea \ a P0 ^! r a haad (5 cards) from an ordinary playing deck. In how many ways can he 
MMt At ‘ ■ ( ’ a kin<1 ' eii > “ <*> ■ of (v) tao of . 

The English alphabet has 26 letters of which 5 are vowels. 


(i) 

(ii) 

(iii) 

(iv) 

(v) 

(vi) 

(vii) 
(viii) 

(ix) 

(x) 


?ormed? ny 5 W ° rdS C ° ntaining 3 different consonants and 2 different vowels can be 

How many of them contain the letter 6? 

How many of them contain the letters b and c? 

How many of them begin with b and contain the letter c ? 

How many of them begin with b and end with c? 

How many of them contain the letters a and 6? 

How many of them begin with a and contain b ? 

How many of them begin with b and contain a? 

How many of them begin with a and end with 6? 

How many of them contain the letters a , b and c? 


ORDERED AND UNORDERED PARTITIONS 

2.61. In how many ways can 9 toys be divided evenly among 3 children? 

2.62. In how many ways can 9 students be evenly divided into three teams? 

2-63 ‘ WayS Can 10 StUdentS be diVided int ° three teams ’ ° ne coding 4 students and 


2.64. There are 12 balls in an urn. In how many ways can 3 balls be drawn from the urn, four times 
m succession, all without replacement? 


2.65. 

2 . 66 . 
2.67. 


12 members be partitioned int0 three committees containing 
In how many ways can n students be partitioned into two teams containing at least one student? 
3^^^the W others a 2’ 14 m6n ^ Partiti ° ned int ° 6 committees wher ® 2 of the committees contain 


TREE DIAGRAMS 

2.68. Construct the tree diagram for the number of permutations of {a, b, c, d}. 

2.69. Find the product set {1,2,3} X {2,4} X {2,3,4} by constructing the appropriate tree diagram. 
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2.70. Teams A and B play in a basketball tournament. The first team that wins two games in a row or a 

total of four games wins the tournament. Find the number of ways the tournament can occur. 

2.71. A man has time to play roulette five times. He wins or loses a dollar at each play. The man begins 

with two dollars and will stop playing before the five times if he loses all his money or wins three 
dollars (i.e. has five dollars). Find the number of ways the playing can occur. 

2.72. A man is at the origin on the a?-axis and takes a unit step either to the left or to the right. He 
stops after 5 steps or if he reaches 3 or —2. Construct the tree diagram to describe all possible 
paths the man can travel. 

2.73. In the following diagram let A,B , . ..,F denote islands, and the lines connecting them bridges. 

A man begins at A and walks from island to island. He stops for lunch when he cannot continue 

to walk without crossing the same bridge twice. Find the number of ways that he can take his walk 
before eating lunch. 



2.74. Consider the adjacent diagram with nine points A, B, C, R , S, T, X, 
Y, Z. A man begins at X and is allowed to move horizontally or ver¬ 
tically, one step at a time. He stops when he cannot continue to walk 
without reaching the same point more than once. Find the number of 
ways he can take his walk, if he first moves from X to R . (By sym¬ 
metry, the total number of ways is twice this.) 



C 

T 



Z 


Answers to Supplementary Problems 


2.31. 

2.32. 

2.33. 

2.34. 

2.35. 

2.36. 

2.37. 

2.38. 

2.39. 

2.40. 

2.41. 


(i) 362,880 (ii) 3,628,800 (iii) 39,916,800 
(i) 240 (ii) 2184 (iii) 1/90 (iv) 1/1716 

(i) n + 1 (ii) n(n -l)=n 2 -n (iii) 1 J[n{n + 1 )(n + 2)] (iv) (n - r)(n - r + 1) 

(i) 26 • 25 • 10 • 9 • 8 - 468,000 (ii) 26 • 25 • 9 • 9 • 8 = 421,200 

(i) 6 • 4 = 24 (ii) 6 • 4 • 4 • 6 = 24 • 24 = 576 (iii) 6 • 4 • 3 • 5 = 360 

3 • 5 • 4 • 3 • 2 • 1 = 360 


(ii) 4 • 2! • 3! — 48 


(i) 5! = 120 
(i) 4! = 24 (ii) 2! 3! 

(i) 7 • 6 • 5 • 4 = 840 

(ii) 5 • 4 • 3 • 2 = 120 


= 12 

(iii) 5 • 5 • 4 • 4 = 400 

(iv) 2 • 6 • 5 • 4 = 240 


(v) 4 • 6 * 5 • 4 — 480 

(vi) 1 • 5 • 4 • 2 = 40 


(vii) 1 • 3 • 5 • 4 = 60 
(viii) 4 • 3 • 5 • 4 = 240 


8 ! 


4! 2! 2! 


= 420 


(i) 27^7 = 30 (ii) grfl 21 = 45,360 (iH) 2! 34 2“! = 1^63,200 (iv) 2 ! 2! 2! “ 5040 


9! 


11 ! 


8! 
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2.42. 

(i) 

2 * 4!* 4! 

= 1152 

(ii) 

2 • 7 • 

3! *3! = 

= 504 


(iii) 1152 — 

2.43. 

(i) 

00 

II 

144 (ii) 

2* 

3! *3! 

= 72 

(iii) 

144 - 72 = 72 

2.44. 

(i) 

lo-io*: 

10 = 1000 


(iii) 

10*10 

•10* 

10 

= 10,000 


(ii) 

10*9*8 

= 720 


(iv) 

10*9* 

8*7< 

>6 

= 30,240 

2.45. 

3! 5! 4! 3! = 

103,680 








2.46. (i) 3*9*8 = 216 (ii) 8*8*5 - 320 

(iii) 9*8*1 = 72 end in 0, and 8*8*4 = 256 end in the other even digits; hence, altogether, 
72 + 256 = 328 are even. 

(iv) 9*8*1 = 72 end in 0, and 8*8*1 = 64 end in 5; hence, altogether, 72 + 64 = 136 are 
divisible by 5. 


2.47. 

(i) 

6! 

3! 

120 (ii) 

<N 

II 

(iii) 4*3! 

= 24 (iv) 

4! 

2! = 12 

2.48. 

(i) 

10 

(ii) 35 

(iii) 91 

(iv) 15 

(v) 1140 

(vi) 816 

2.49. 

(i) 

504 

(ii) 210 

(iii) 180 




2.50. 

(i) 

Sx 3 

+ 12 x 2 y 2 + 6xy 4 + y 3 





(ii) x 8 — 12 x 6 y + 54# 4 2/ 2 — 108z 2 2/ 3 + 81 y 4 

(iii) a 5 /32 + 5a 4 6/8 + 5a 3 6 2 + 20a 2 6 3 + 40ab 4 + 326 5 

(iv) 64a 12 - 192a 10 6 + 240a 3 6 2 - 160a 6 6 3 + 60a 4 6 4 - 12a 2 6 5 + 

2.51. Hint. Expand (1 + l) n . 2.53. 70 x 8 y 12 

2.52. Hint. Expand (1 — l) n . 2.54. 945 x 3 y 6 z 8 

2M. 0) (“) = 495, «') (?) - (J) = 

2,.«. in (“) = 462, <»>(;)+(;) = « 

2.57. (i) (^j ~ 45 ’ ( fi > (2) = 28 ’ (ii *^ 

2.5*. 0) (“) = (") = 286 

« (s) - (s) - » 

<“» 2 • (”) = 2 • ( s ) = 110 


369, (iii) 3 • ( ) = 252 



2.59. 


2.60. 


(i) 4*10 = 40, 
flushes.) (iv) 



(ii) 13*48 = 624, (iii) 10 • 4 5 — 40 = 10,200. (We subtract the number of straight 

©(s 2 ) ’ 43 = 84 ’ 480 ’ (v) 13 ‘ (SSXs 2 ) * 43 = ^9S,240 

4 • X X 3 * ~ 
4*19*5! = 9120 


5! = 22,800 

(vii) 

/ 20 

4 -U 

j • 4! = 18,240 

4! = 4560 

(viii) 

18,240 

(same as (vii)) 


5! 

= 1,596,000 

(v) 


1*3! 

= 1140 

(ix) 

5! 

= 228,000 

(Vi) 

4-W 

• 5! 

= 91,200 

(X) 
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2.61. 

2.62. 


2.63. 


3! 3!3! ~ 1680 
1680/3! = 280 or 


10! 1 
4 ! 3 ! 3 ! ' 2 ! 


2100 



2.64. 


12! 

3! 3!3! 3! 


= 369,600 


2 . 66 . 2 n ~ 1 — 1 


2.65. 


12 ! 

5! 4!3! 


27,720 


2.67. 


14! 1 

3! 3! 2! 2! 2! 2! * 2! 4! 


3,153,150 


2.69. 



( 1 . 2 , 2 ) 
(1. 2, 3) 
(1, 2, 4) 

(1, 4, 2) 
(1, 4, 3) 
(1, 4, 4) 

( 2 , 2 , 2 ) 
(2, 2, 3) 
(2, 2, 4) 

(2, 4, 2) 
(2, 4, 3) 
(2, 4, 4) 

(3, 2, 2) 
(3, 2, 3) 
(3, 2, 4) 

(3, 4, 2) 
(3, 4, 3) 
(3, 4, 4) 


The eighteen elements of the product set are listed to the right of the tree diagram. 


2.70. 14 ways 


2.71. 20 ways (as seen in the following diagram): 



I 
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2.72. Hint. The tree is essentially the same as the tree of the preceding problem. 

2.73. The appropriate tree diagram follows: 



There are eleven ways to take his walk. Observe that he must eat his lunch at either B, D or E. 

2.74. The appropriate tree diagram follows: 



There are 10 different trips. (Note that in only 4 of them are all nine points covered.) 



Chapter 3 


Introduction to Probability 


INTRODUCTION 

Probability is the study of random or nondeterministic experiments. If a die is tossed 
in the air, then it is certain that the die will come down, but it is not certain that, say, 
a 6 will appear. However, suppose we repeat this experiment of tossing a die; let s be 
the number of successes, i.e. the number of times a 6 appears, and let n be the number of 
tosses. Then it has been empirically observed that the ratio / — s/n, called the relative 
frequency, becomes stable in the long run, i.e. approaches a limit. This stability is the 
basis of probability theory. 

In probability theory, we define a mathematical model of the above phenomenon by 
assigning “probabilities” (or: the limit values of the relative frequencies) to the “events” 
connected with an experiment. Naturally, the reliability of our mathematical model for a 
given experiment depends upon the closeness of the assigned probabilities to the actual 
relative frequency. This then gives rise to problems of testing and reliability which form 
the subject matter of statistics. 

Historically, probability theory began with the study of games of chance, such as 
roulette and cards. The probability p of an event A was defined as follows: if A can occur 
in s ways out of a total of n equally likely ways, then 

* = P(A) = | 

For example, in tossing a die an even number can occur in 3 ways out of 6 “equally likely” 
ways; hence p = § = i. This classical definition of probability is essentially circular since 
the idea of “equally likely” is the same as that of “with equal probability” which has not 
been defined. The modern treatment of probability theory is purely axiomatic. This means 
that the probabilities of our events can be perfectly arbitrary, except that they must satisfy 
certain axioms listed below. The classical theory will correspond to the special case of 
so-called equiprobable spaces . 


SAMPLE SPACE AND EVENTS 

The set S of all possible outcomes of some given experiment is called the sample space . 
A particular outcome, i.e. an element in S, is called a sample point or sample. An event A 
is a set of outcomes or, in other words, a subset of the sample space S. The event {a} con¬ 
sisting of a single sample a G S is called an elementary event. The empty set 0 and S 
itself are events; 0 is sometimes called the impossible event, and S the certain or sure event. 

We can combine events to form new events using the various set operations: 

(i) AuB is the event that occurs iff A occurs or B occurs (or both); 

(ii) AnB is the event that occurs iff A occurs and B occurs; 

(iii) A c , the complement of A, is the event that occurs iff A does not occur. 
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Two events A and B are called mutually exclusive if they are disjoint, i.e. if AnB = 0. 
In other words, A and B are mutually exclusive if they cannot occur simultaneously. 


Example 3.1: Experiment: Toss a die and observe the number that appears on top. Then the 
sample space consists of the six possible numbers: 

S = {1, 2, 3, 4, 5, 6} 

Let A be the event that an even number occurs, B that an odd number occurs and 
C that a prime number occurs: 

A = {2,4,6}, B = {1,3,5}, C = {2,3,5} 

Then: 

AuC = {2, 3, 4, 5, 6} is the event that an even or a prime number occurs; 

BnC — {3, 5} is the event that an odd prime number occurs; 

C c = {1, 4, 6} is the event that a prime number does not occur. 

Note that A and B are mutually exclusive: AnB = 0; in other words, an even 
number and an odd number cannot occur simultaneously. 


Example 3.2: Experiment: Toss a coin 3 times and observe the sequence of heads (H) and 

tails (T) that appears. The sample space S consists of eight elements: 

S = {HHH, HHT, HTH, HTT, THH, THT, TTH, TTT} 

Let A be the event that two or more heads appear consecutively, and B that all 
the tosses are the same: 

A = {HHH, HHT, THH} and B — (HHH, TTT} 

Then AnB = {HHH} is the elementary event in which only heads appear. The 
event that 5 heads appear is the empty set 0. 


Example 3.3: Experiment: Toss a coin until a head appears and then count the number of times 
the com was tossed. The sample space of this experiment is S = {1, 2,3, ..»}. 
Here « refers to the case when a head never appears and so the coin is tossed an 
infinite number of times. This is an example of a sample space which is countably 
infinite. 


Example 3.4: Experiment: Let a pencil drop, head first, into a 

rectangular box and note the point on the bottom 
of the box that the pencil first touches. Here S 
consists of all the points on the bottom of the box. 
Let the rectangular area on the right represent 
these points. Let A and B be the events that the 
pencil drops into the corresponding areas illus¬ 
trated on the right. This is an example of a sam¬ 
ple space which is not finite nor even countably 
infinite, i.e. which is uncountable. 



Remark: If the sample space S is finite or countably infinite, then every subset of S is 
an event. On the other hand, if S is uncountable, as in Example 3.4, then for 
technical reasons (which lie beyond the scope of this text) some subsets of S 
cannot be events. However, in all cases the events shall form a <r-algebra £ of 
subsets of S. 
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AXIOMS OF PROBABILITY 

Let S be a sample space, let £ be the class of events, and let P be a real-valued function 
defined on £. Then P is called a pi'obability function , and P(A) is called the probability of 
the event A if the following axioms hold: 

[Pi] For every event A, 0 ^ P(A) ^ 1. 

[Ps] P(S) = 1. 

[P 3 ] If A and B are mutually exclusive events, then 

P(AuP) = P(A) + P(P) 

[P 4 ] If Ai, A 2 , ... is a sequence of mutually exclusive events, then 

P(A 1 UA 2 U**‘) = P(Ai)+P(A 2 ) + 

The following remarks concerning the axioms [P 3 ] and [P 4 ] are in order. First of all, 
using [P 3 ] and mathematical induction we can prove that for any mutually exclusive events 

P(AxUA 2 U * * • UA n ) = P(Ai) +P(A 2 ) + ■ • • + P(A«) (*) 

We emphasize that [P 4 ] does not follow from [P 3 ] even though (*) holds for every positive 
integer n . However, if the sample space S is finite, then clearly the axiom [P 4 ] is superfluous. 

We now prove a number of theorems which follow directly from our axioms. 

Theorem 3.1: If 0 is the empty set, then P(0) = 0. 

Proof : Let A be any set; then A and 0 are disjoint and AU0 — A. By [P 3 ], 

P(A) = P(AU0) = P(A)+P(0) 

Subtracting P(A) from both sides gives our result. 

Theorem 3.2: If A c is the complement of an event A, then P(A C ) = 1 — P(A). 

Proof : The sample space S can be decomposed into the mutually exclusive events A and 
A c ; that is, S = AuA c . By [P 2 ] and [P 3 ] we obtain 

1 = p(S) = P(A U A c ) = P(A) + P(A*) 

from which our result follows. 

Theorem 3.3: If AcP, then P(A) ^ P(P). 

Proof . If A CP, then B can be decomposed into the mutually 
exclusive events A and B\A (as illustrated on the right). 

Thus 

P(B) = P(A) + P{B\ A) 

The result now follows from the fact that P(B \ A) — 0. 

Theorem 3.4: If A and B are any two events, then 
P(A\B) = P(A) - P(AnB) 

Proof . Now A can be decomposed into the mutually ex¬ 
clusive events A\B and AnP; that is, A = (A\P)U(AnP). 

Thus by [P 3 ], 

P(A) = P(A\B) + P{AHB) 
from which our result follows. 
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Theorem 3.5: If A and B are any two events, then 

P(A U B) = P(A) + P(B) - P(A riB) 

Proof. Note that A UB can be decomposed into the mutually 
exclusive events A\B and B\ that is, AUB = (A\B)UB. Thus 
by [P 3 ] and Theorem 3.4, 

P{A\JB) = P(A\B) + P(B) 

= P{A) - P(AnB) + P(B) 

= P(A) + P(B) — P(AnB) 

which is the desired result. 

Applying the above theorem twice (Problem 3.23) we obtain 
Corollary 3.6: For any events A, B and C, 

P(A UBUC) = P{A) + P(B) + P(C) ~ P(AHB) - P(Af)C) - P(BnC) + P(AnBDC) 

FINITE PROBABILITY SPACES 

Let , S u be a finite sam P le s P ace ; say. s = { 01 , 02 , .. .,o»). A finite probability space is 
o tamed by assigning to each point Oi€S a real number p it called the probability of a f 
satisfying the following properties: * 

(i) each p ; is nonnegative, p t Ss 0 

(ii) the sum of Ihe Pi is one, Pi + p 2 + • ■ • + p„ = 1. 

The probability P(A) of any event A, is then defined to be the sum of the probabilities of 
the points m A. For notational convenience we write P(a t ) for P({Oi}). 

Example 3.5: Let three coins be tossed and the number of heads observed; then the sample space 
is S - {0,1,2, 3}. We obtain a probability space by the following assignment 

P (°) = i> p (!) = f. P( 2) = f and P(3) = £ 

since each probability is nonnegative and the sum of the probabilities is 1. Let A 
be the event that at least one head appears and let B be the event that all heads 
or all tails appear: 

A = {1,2,3} and B = {0,3} 

Then, by definition, 

P(A) = P(l) + P(2) + P(3) = ! + § + £ = £ 
an d P(B) = P(0) + P(3) = ^ + ^ = 1 

Example 3.6: Three horses A, B and C are in a race; A is twice as likely to win as B and B is 
hkely to^win as C. What are their respective probabilities of winning, 

Let P(C) = p; since B is twice as likely to win as C, P(B) = 2 p; and since 
A is twice as likely to win as B, P(A) = 2 P(B) = 2(2p) = 4p. Now the sum of the 
probabilities must be 1; hence 

P d - + 4 p = 1 or 7 p ~ 1 or p — i 

Accordingly, 

P(A) = 4p = i, P(B) = 2p = P(C) = p = i 

Question: What is the probability that B or C wins, i.e. P({B, C})? By definition 
P({B, C}) = P(B) + P(C) = | + 1 = a 



AuB is shaded. 
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FINITE EQUIPROBABLE SPACES 

Frequently, the physical characteristics of an experiment suggest that the various 
outcomes of the sample space be assigned equal probabilities. Such a finite probability 
space S, where each sample point has the same probability, will be called an equiprobable 

or uniform space . In particular, if S contains n points then the probability of each point 

1 r 

is 1 In. Furthermore, if an event A contains r points then its probability is r • — = —. In 
other words, 

p/4) — number of elements in A 

' number of elements in S 


P(a\ — number of ways that the event A can occur 
' number of ways that the sample space S can occur 

We emphasize that the above formula for P(A) can only be used with respect to an 
equiprobable space, and cannot be used in general. 

The expression “at random” will be used only with respect to an equiprobable space; 
formally, the statement “choose a point at random from a set S” shall mean that S is an 
equiprobable space, i.e. that each sample point in S has the same probability. 


Example 3.7: Let a card be selected at random from an ordinary deck of 52 cards. Let 

A ==. {the card is a spade} 

and B — {the card is a face card, i.e. a jack, queen or king} 


We compute P(A), P(B) and P(AnB). Since we have an equiprobable space, 


P(A) = 


number of spades __ 13 
number of cards 52 


1 

4 


P(B) = 


number of face cards 
number of cards 


12 _ _3_ 
52 ~ 13 


P(AnB) 


number of spade face cards _ 3 
number of cards 52 


Example 3.8: Let 2 items be chosen at random from a lot containing 12 items of which 4 are 

defective. Let 

A = {both items are defective} and B — {both items are non-defective} 
Find P(A) and P(B). Now 

S can occur in ) = 66 ways, the number of ways that 2 items can be 
chosen from 12 items; 

A can occur in (%) =6 ways, the number of ways that 2 defective items 

can be chosen from 4 defective items; 

B can occur in (®) = 28 ways, the number of ways that 2 non-defective 

items can be chosen from 8 non-defective items. 

Accordingly, P(A) = ^ = n and P(B) = f§ = if. 

Question: What is the probability that at least one item is defective? Now 

C = {at least one item is defective} 

is the complement of B; that is, C = B c . Thus by Theorem 3.2, 

P(C) = P(B C ) = 1 - P(B) = 1 -§ = i 

The odds that an event with probability p occurs is defined to be the ratio p : (1 — p). 
Thus the odds that at least one item is defective is ||: H or 19 :14 which is read 
“19 to 14”. 
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Example 3.9: 


(Classical Birthday Problem.) We seek the probability p that n people have dis- 
inct birthdays. In solving this problem, we ignore leap years and assume that a 
person s birthday can fall on any day with the same probability. 


Since there are n people and 365 different days, there are 365* ways in which 
the n people can have their birthdays. On the other hand, if the n persons are to 
ave distinct birthdays, then the first person can be born on any of the 365 days 
the second person can be born on the remaining 364 days, the third person can be 
born on the remaining 363 days, etc. Thus there are 365 • 364 • 363 * * * (365 — n + 1) 
ways the n persons can have distinct birthdays. Accordingly, 


P 


365 • 364 • 363 • • • (365 - n + 1) 
365* 


365 

365 


554 

365 


363 

365 


365 


It can be shown that for n - 23, p < . i; in other words, amongst 23 or more 
people it is more likely that at least two of them have the same birthday than 
that they all have distinct birthdays. 


INFINITE SAMPLE SPACES 

Now suppose S is a countably infinite sample space; say S = {a h a 2 , . ..}. As in the 

i “pSiS.°^“tLt robaMity space by asstoing to each * e s ° real number called 

(i) Pi — 0 and (ii) pi + p 2 + • • • = jr P* = 1 
The probability P(A) of any event A is then the sum of the probabilities of its points. 


Example 3.10: 


Cons ider the sample space S = (1,2,3, ..., »} of the experiment of tossing a 
com till a head appears; here n denotes the number of times the coin is tossed 
A probability space is obtained by setting 


P( 1) 


= b P(2) = A 


p(n) = 1/2", 


P(°°) = 0 


The only uncountable sample spaces S which we will consider here are those with some 
finite geometrical measurement m(S) such as length, area or volume, and in which a point 
is selected at random. The probability of an event A, i.e. that the selected point belongs 
to A, is then the ratio of m(A) to m(S); that is, 

PC 1 \ _ area of A volume of A 

F{A) ~ area of S or P ( A ) 


p( _ length of A 
' ' ~ length of S 


or 


Such a probability space is said to be uniform. 


volume of S 


Example 3J1: On the real line R, points a and b are selected at random such that — 2 ^ b ^ 0 

and 0 ^ a ^ 3, as shown below. Find the probability p that the distance d be- 
tween a and b is greater than 3. 


h-' d 

6 S 


a 


The sample space S consists of the or¬ 
dered pairs (a, b ) and so forms the rectangu¬ 
lar region shown in the adjacent diagram. 
On the other hand, the set A of points (a, b) 
for which d = a — 6 > 3 consists of those 
points of S which lie below the line x — y — 3, 
and hence forms the shaded area in the 
diagram. Thus 

p - r(A) - areaof5 -6 = 3 



Remark: A finite or countably infinite probability space is said to be discrete, and an 
uncountable space is said to be nondiscrete. 
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Solved Problems 

SAMPLE SPACES AND EVENTS 

3.1. Let A and B be events. Find an expression and exhibit the Venn diagram for the 
event that: (i) A but not B occurs, i.e. only A occurs; (ii) either A or B, but not 
both, occurs, i.e. exactly one of the two events occurs. 

(i) Since A but not B occurs, shade the area of A outside of B as in Figure (a) below. Note 
that B c , the complement of B , occurs since B does not occur; hence A and B c occurs. In other 
words, the event is A n B c . 



(ii) Since A or B but not both occurs, shade the area of A and B except where they intersect as 
in Figure (b) above. The event is equivalent to A but not B occurs or B but not A occurs. 
Now, as in (i), A but not B is the event AnB c , and B but not A is the event BnA c . Thus 
the given event is (AnB c ) U (BnA c ). 


3.2. Let A, B and C be events. Find an expression and exhibit the Venn diagram for 
the event that (i) A and B but not C occurs, (ii) only A occurs. 

(i) Since A and B but not C occurs, shade the intersection of A and B which lies outside of C, 
as in Figure (a) below. The event is AnBnC c . 




(ii) Since only A is to occur, shade the area of A which lies outside of B and of C, as in Figure ( b ) 
above. The event is AnB c nC c . 


3.3. Let a coin and a die be tossed; let the sample space S consist of the twelve elements: 
S - {HI, H2, H3, H4, H5, H6, Tl, T2, T3, T4, T5, T6) 

(i) Express explicitly the following events: A = (heads and an even number ap¬ 
pear), B — {a prime number appears), C = (tails and an odd number appear). 

(ii) Express explicitly the event that: (a) A or B occurs, ( b ) B and C occurs, 
(c) only Z?. occurs. 

(iii) Which of the events A, B and C are mutually exclusive? 
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(i) To obtain A, choose those elements of S' consisting of an H and an even number: A = 
{H2,H4,H6}. 

To obtain B , choose those points in S consisting of a prime number: B = {H2, H3, H5, T2, T3, T5}. 
To obtain C, choose those points in S consisting of a T and an odd number: C = {T1,T3,T5}. 

(ii) (a) A or B = AuB = (H2, H4, H6, H3, H5, T2, T3, T5} 

(6) B and C = Bn C = {T3, T5} 

(c) Choose those elements of B which do not lie in A or C: Br\A c r\C c — (H3, H5, T2}. 

(iii) A and C are mutually exclusive since Ar\C ~ 0. 

FINITE PROBABILITY SPACES 

3.4. Suppose a sample space S consists of 4 elements: S= {di, d 2 , d 3 , a 4 }. Which func¬ 
tion defines a probability space on S? 

(i) P(ai) = b P(a 2 ) = b P(a 3 ) = i, P(a 4 ) = f 

(ii) P(oi) = i, P(a 2 ) = i, P(a 3 ) = -i, P(a 4 ) = *. 

(iii) P(ai) = i, P(a 2 ) = i, P(a 3 ) = £, P(a 4 ) = 

(iv) P(ai) = b P(a 2 ) = i, P(a 3 ) = i, P(a 4 ) = 0. 

(i) Since the sum of the values on the sample points is greater than one, i + 4 + i + i 
the function does not define a probability space on S. 

(ii) Since P(a 3 ) = — J, a negative number, the function does not define a probability space on S. 

(iii) Since each value is nonnegative, and the sum of the values is one, ^ £ -f J ^ = 1, the 
function does define a probability space on S. 

(iv) The values are nonnegative and add up to one; hence the function does define a probability 
space on S. 

3.5. Let S = (di, d 2 , d 3 , d 4 }, and let P be a probability function on S. 

(i) Find P(ai) if P(a 2 ) = P(a 3 ) = P(a 4 ) = b 

(ii) Find P(a 4 ) and P(a 2 ) if P(a 3 ) = P(a 4 ) = i and P(ai) = 2 P(a 2 ). 

(iii) Find P(ai) if P({a 2 , a 3 }) = f, P({a 2 , a 4 }) = | and P(a 2 ) = i. 

(i) Let P(a x ) = p. Then for P to be a probability function, the sum of the probabilities on the 
sample points must be one: ?>+ 3 - + ^ + ^ := l or P = Yb- 

(ii) Let P(a 2 ) — p, then P(ad = 2p. Hence 2p-fp + J + J = l or p = ^. Thus P(a 2 ) = ^ and 
P(a x ) = f 

(iii) Let P(a x ) = p. P(a 3 ) = P({a 2 , a 3 }) - P(a 2 ) = f i 

P( a 4 ) = P(i a 2» ® 4 }) P(®2) = £ £ = £ 

Then p+£+£+£=l or p = £, that is, P(ad = £. 

3.6. A coin is weighted so that heads is twice as likely to appear as tails. Find P(T) 
and P(H). 

Let P(T) = p; then P(H) = 2p. Now set the sum of the probabilities equal to one: p + 2p = 1 
or p = i. Thus P(T) = p = £ and P(H) = 2p = §. 
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3.7. Two men, mi and m 2 , and three women, w i, w 2 and wz, are in a chess tournament. 
Those of the same sex have equal probabilities of winning, but each man is twice 
as likely to win as any woman, (i) Find the probability that a woman wins the 
tournament, (ii) If mi and Wi are married, find the probability that one of them 
wins the tournament. 

Set P(w l ) — p; then P(w 2 ) = P(w 3 ) = p and P(m x ) = P(m 2 ) = 2 p. Next set the sum of the 
probabilities of the five sample points equal to one: p + p + p + 2p + 2p = l or p — 1. 

We seek (i ) P({w v w 2 ,w 3 }) and (ii) P({ra 1 ,w> 1 }). Then by definition, 

P({w u w 2 ,w 3 }) = P(w x ) + P(w 2 ) + P(w 3 ) = l + I+l = f 
P({m v w x }) = P(m x ) + P(w x ) = f + i = f 


3.8. Let a die be weighted so that the probability of a number appearing when the die 
is tossed is proportional to the given number (e.g. 6 has twice the probability of 
appearing as 3). Let A = {even number}, B — (prime number}, C ~ {odd number). 

(i) Describe the probability space, i.e. find the probability of each sample point. 

(ii) Find P(A), P(B) and P(C). 

(iii) Find the probability that: (a) an even or prime number occurs; ( b ) an odd 
prime number occurs; (c) A but not B occurs. 

(i) Let P(l) = p. Then P(2) = 2 p, P( 3) = 3 p, P(4) = 4 p, P(5) = 5 p and P(6) = 6p. Since the sum 
of the probabilities must be one, we obtain p 4- 2p + Sp + 4p + 5p + 6p = 1 or p = 1/21. Thus 

P( 1)=h, P(2)=iv P(*) = b P(*)=Ti’ P( V = h P( V = t 

(ii) P(A) = P({2,4,6}) = i, P(B) =>({2,3,5}) = §?, P{C) = P({1,3,5}) = f. 

(iii) (a) The event that an even or prime number occurs is AuB — (2, 4,6, 3, 5}, or that 1 does 

not occur. Thus P(AuP) = 1 — P(l) = ff. 

(6) The event that an odd prime number occurs is BnC — {3,5}. Thus P(BnC) = 
P({3,5})=£. 

(c) The event that A but not B occurs is AnB c = (4,6). Hence P{AnB c ) = P({4,6}) = |f- 


FINITE EQUIPROBABLE SPACES 

3.9. Determine the probability p of each event: 

(i) an even number appears in the toss of a fair die; 

(ii) a king appears in drawing a single card from an ordinary deck of 52 cards; 

(iii) at least one tail appears in the toss of three fair coins; 

(iv) a white marble appears in drawing a single marble from an urn containing 
4 white, 3 red and 5 blue marbles. 

(i) The event can occur in three ways (a 2, 4 or 6) out of 6 equally likely cases; hence p = f = 

(ii) There are 4 kings among the 52 cards; hence p — 52 = j 3 * 

(iii) If we consider the coins distinguished, then there are 8 equally likely cases: HHH,HHT, 
HTH, HTT, THH, THT, TTH, TTT. Only the first case is not favorable to the given event; 
hence p 

(iv) There are 4 + 3 + 5 = 12 marbles, of which 4 are white; hence p = ^ = f. 
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3.10. Two cards are drawn at random from an ordinary deck of 52 cards. Find the proba¬ 
bility p that (i) both are spades, (ii) one is a spade and one is a heart. 

There are ( 5 2 2 ) = 1326 ways to draw 2 cards from 52 cards. 

(i) There are (g 3 ) = 78 ways to draw 2 spades from 13 spades; hence 

_ number of ways 2 spades can be drawn _ 78 _ 

^ number of ways 2 cards can be drawn 1326 17 


(ii) Since there are 13 spades and 13 hearts, there are 13 • 13 = 169 ways to draw a spade and a 
heart; hence p = ^ 


3.11. Three light bulbs are chosen at random from 15 bulbs of which 5 are defective. 

Find the probability p that (i) none is defective, (ii) exactly one is defective, (iii) at 

least one is defective. 

There are (g 5 ) — 455 ways to choose 3 bulbs from the 15 bulbs. 

(i) Since there are 15 — 5 = 10 nondefective bulbs, there are (*®) = 120 ways to choose 3 non¬ 
defective bulbs. Thus p = . 

(ii) There are 5 defective bulbs and ( x ®) = 45 different pairs of nondefective bulbs; hence there 
are 5 • 45 = 225 ways to choose 3 bulbs of which one is defective. Thus p = |§§ = ||. 

(iii) The event that at least one is defective is the complement of the event that none are defective 
which has, by (i), probability f~. Hence p = 1 — f| = |j. 


3.12. Two cards are selected at random from 10 cards numbered 1 to 10. Find the proba¬ 
bility p that the sum is odd if (i) the two cards are drawn together, (ii) the two 
cards are drawn one after the other without replacement, (iii) the two cards are 
drawn one after the other with replacement. 

(i) There are (g°) = 45 ways to select 2 cards out of 10. The sum is odd if one number is odd 
and the other is even. There are 5 even numbers and 5 odd numbers; hence there are 
5 • 5 = 25 ways of choosing an even and an odd number. Thus p = f§ = §* 

(ii) There are 10 • 9 = 90 ways to draw two cards one after the other without replacement. 
There are 5 • 5 = 25 ways to draw an even number and then an odd number, and 5*5 = 25 
ways to draw an odd number and then an even number; hence p = — = §5 = f ■ 

(iii) There are 10 • 10 = 100 ways to draw two cards one after the other with replacement. As 
in (ii), there are 5*5 = 25 ways to draw an even number and then an odd number, and 
5 • 5 = 25 ways to draw an odd number and then an even number; hence p = 25 1 q Q 25 = ^ = j. 


3.13. Six married couples are standing in a room. 

(i) If 2 people are chosen at random, find the probability p that (a) they are 
married, ( b ) one is male and one is female. 

(ii) If 4 people are chosen at random, find the probability p that (a) 2 married 

couples are chosen, ( b ) no married couple is among the 4, (c) exactly one 

married couple is among the 4. 

(iii) If the 12 people are divided into six pairs, find the probability p that (a) each 
pair is married, ( b ) each pair contains a male and a female. 
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(i) There are (g 2 ) = 66 ways to choose 2 people from the 12 people. 

(а) There are 6 married couples; hence p = ~. 

( б ) There are 6 ways to choose a male and 6 ways to choose a female; hence p = ^ 

(ii) There are ( 4 2 ) = 495 ways to choose 4 people from the 12 people. 

(а) There are (|) — 15 ways to choose 2 couples from the 6 couples; hence p = ^ 

( б ) The 4 persons come from 4 different couples. There are ( 4 ) = 15 ways to choose 4 couples 

from the 6 couples, and there are 2 ways to choose one person from each couple. Hence 
_ 2-2*2.2«15 __ 16 
P 495 33* 

(c) This event is mutually disjoint from the preceding two events (which are also mutually 

disjoint) and at least one of these events must occur. Hence P + + |§ = 1 or P=|§* 

(iii) There are 2 ? 2 ! 2 ! 2 2 ! 2 ! 2 ; = W wa y s to partition the 12 people into 6 ordered cells with 2 people 
in each. 

(a) The 6 couples can be placed into the 6 ordered cells in 6 ! ways. Hence p = \ 2 02g ~ ioii95 • 

( b ) The six men can be placed one each into the 6 cells in 6 ! ways, and the 6 women can be 
placed one each into the 6 cells in 6 ! ways. Hence p = t |j ^ 6 = ^. 

3.14. A class contains 10 men and 20 women of which half the men and half the women 
have brown eyes. Find the probability p that a person chosen at random is a man 
or has brown eyes. 

Let A = {person is a man} and B = {person has brown eyes}. We seek P(AuB). 

Then P(A) = §, P(B) = §§ = P(A n B)=^ = Thus by Theorem 3.5, 

p = P(AuB) = P(A) + P(B) - P(AnB) = £ + = f 


UNCOUNTABLE UNIFORM SPACES 


3.15. 


A point is selected at random inside a circle. Find the 
probability p that the point is closer to the center of the 
circle than to its circumference. 


Let S denote the set of points inside the circle with radius r, 
and let A denote the set of points inside the concentric circle of 
radius \r. (Thus A consists precisely of those points of S which 
are closer to its center than to its circumference.) Accordingly, 


V = P(A) 


area of A ___ ^(^ r ) 2 _ 1^ 
area of S n-r 2 4 



3.16. 


Consider the Cartesian plane R 2 , and let X denote the subset of points for which 
both coordinates are integers. A coin of diameter | is tossed randomly onto the 
plane. Find the probability p that the coin covers a point of X. 


Let S denote the set of points inside a square with corners 
(m, n), (m 9 n+ 1), (m+1, n), (m + 1, €= X 

Let A denote the set of points in S with distance less than J from 
any corner point. (Observe that the area of A is equal to the area 
inside a circle of radius J.) Thus a coin whose center falls in S will 
cover a point of X if and only if its center falls in a point of A. 
Accordingly, 

area of A _ ^(i) 2 _ it _ 0 

16 


V = P(A) 


area of S 1 

Note . We cannot take 5 to be all of R 2 because the latter has 


infinite area. 
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3.17. Three points a, b and c are selected at random from the 
circumference of a circle. Find the probability p that the 
points lie on a semicircle. 

Suppose the length of the circumference is 2 s. Let x denote 
the clockwise arc length from a to 6 , and let y denote the clockwise 
arc length from a to c. Thus 


0 < x < 2s and 0 < y < 2 s (*) 

Let S denote the set of points in R 2 for which condition (*) holds. 

Let A denote the subset of S for which any of the following condi¬ 

tions holds: 

(i) x f y < s (iii) x < s and y — x > s 


(ii) x,y > s (iv) y < s and x — y > s 

Then A consists of those points for which a, b and c lie on a semi¬ 
circle. Thus _ . 0 9 „ 

a of A _ 3 s 2 _ 3 

a of S 4s 2 4 


P = 




A is shaded. 


MISCELLANEOUS PROBLEMS 

3.18. Let A and B be events with P(A) = f, P(B) = \ and P(Afl5) = £. Find (i) P(Al)B), 
(ii) P(A C ) and P(B C ), (iii) P(A c nB c ), (iv) P(A C UB C ), (v) P(AnB c ), (vi) P(PnA c ). 

(i) P(AuB) = P(A) + P(B) — P(AnB) = f + i-i = f 

(ii) P(A C ) = 1 - P(A) = 1 - | = | and P(B<=) = 1 - P(B) = 1 - \ \ 

(iii) Using De Morgan’s Law, (AkjB) c = A c r\B c , we have 

P(A c nB c ) = P((AdB) c ) = 1 — P(AuB) = 1-f = f 

(iv) Using De Morgan’s Law, (AnB) c = A c uB c , we have 

P(A'UB') = P((AnB) c ) = 1 -P(AnB) = l~l = | 

Equivalently, 

P(A'uB c ) = P(A') + P(B<=) - P(A'nBc) = f + i-f = | 

(v) P(AnB c ) = P(A \ B) = P(A)-P(AnB) = f-J = * 

(vi) P(BnAo) = P(B) — P(Af\B) = = J 

3.19. Let A and B be events with P(A U B) = §, P(A C ) = f and P(AnB) = J. Find 
(i)P(A), (ii)P(P), (iii) P(AnB‘). 

(i) P(A) = l-P(Ac) = 1-f = i 

(ii) Substitute in P(AuP) = P(A) + P(B) — P(AnB) to obtain 4 = ^ + P(B) — J or P(B) = 

(iii) P(AnB c ) = P(A) — P(AnB) = £ 

3.20. Find the probability p of an event if the odds that it will occur are a: b, that is, 
“a to b”. 

The odds that an event with probability p occurs is the ratio p : (1 — p). Hence 

pa , t a 

-—^— = t; or bp — a — ap or ap bp = a or p = —y~t 
1 — p o a \ o 

3.21. Find the probability p of an event if the odds that it will occur are “3 to 2”. 

= | from which p — §. We can also use the formula of the preceding problem to obtain 
the answer directly: p — = 3^2 = §• 
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3.22. A die is tossed 100 times. The following table lists the six numbers and frequency 
with which each number appeared: 


Number 

1 

2 

3 

4 

5 

6 

Frequency 

14 

17 

20 

18 

15 

1G 


Find the relative frequency / of the event (i) a 3 appears, (ii) a 5 appears, (iii) an 
even number appears, (iv) a prime appears. 


The relative frequency / 


number of successes 
total number of trials ' 


(i) / — 100 — .20 


(ii) / = i(Kj = *15 (iii) / = ■ 


.51 


0v) 


= .52 


3.23. Prove Corollary 3.6: For any events A, B and C, 

P(AuBuC) = P(A) + P(B) + P(C) - P(AnB) - P(AnC) - P{BnC) + P(AnBnC) 

Let D = BuC. Then A n D ~ A n {BuC) = (AnB) u (AnC) and 

P(AnD) = P(AnB) + P(AnC) - P(AnBnAnC) = P(A nB) + P(A nC) - P(AnBnC) 

Thus 

P(AuBuC) - P(AuD) = P(A) + P(D) - P(AnD) 

= P(A) + P(B) + P(C) - P(BnC) - [P(A n B) + P(A n C) - P(AnBnC)] 

= P(A) + P(B) + P(C) - P(BnC) - P(A n B) - P(AnC) + P(AnBnC) 


3.24. Let S — {a lf a 2f and T — 62 , be finite probability spaces. 

Let the number Pij = P(a,i)P(bj) be assigned to the ordered pair (a if bj) in the product 
set S x T = {(s, t) : s G S, t G T}. Show that the Pa define a probability space on 
SxT, i.e. that the Pa are nonnegative and add up to one. (This is called the product 
probability space . We emphasize that this is not the only probability function that 
can be defined on the product set S x T.) 

Since P(a i ) 7 P(bj) — 0, for each i and each j, = P{a^) P(bj) — 0. Furthermore, 

Pll + P 12 + ’ * * + Pit + P21 + P 22 + * * * + P2t + * * ' + Psl + Ps2 + * * * + Pst 
= P(a 1 )P(b 1 ) + •■• + P(a 1 )P(b t ) + ••• + P(a s )P( 6 1 ) + ••■ + P(o.) P( 6 t ) 

= P(«i)[P(6i) + ■•• + P(b t )] + •■• + P(a s )[P(b 1 ) + ■•• + P(b t )] 

= P(a x ) • 1 + ■ • • + P(a.) • 1 
= P(«i) + • ■ • + P(a„) 


X 
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Supplementary Problems 

SAMPLE SPACES AND EVENTS 

3.25. Let A and B be events. Find an expression and exhibit the Venn diagram for the event that 

(i) A or not B occurs, (ii) neither A nor B occurs. 

3.26. Let A, B and C be events. Find an expression and exhibit the Venn diagram for the event that 
(i) exactly one of the three events occurs, (ii) at least two of the events occurs, (iii) none of 
the events occurs, (iv) A or B, but not C , occurs. 

3.27. Let a penny, a dime and a die be tossed. 

(i) Describe a suitable sample space S. 

(ii) Express explicitly the following events: A — {two heads and an even number appear}, 
B — (a 2 appears}, C — {exactly one head and a prime number appear}. 

(iii) Express explicitly the event that (a) A and B occur, ( 6 ) only B occurs, ( c) B or C occurs. 


FINITE PROBABILITY SPACES 

3.28. Which function defines a probability space on S = {a u a 2 , a 3 }? 

(i) P(a i) = P(a 2 ) = L P(o 8 ) = | (iii) P(a 1 ) = ± P(a 2 ) = ± P(a 8 ) = * 

(ii) P(a ± ) = f, P(a 2 ) - —i P(a 3 ) = f (iv) P(a t ) = 0, P(a 2 ) = ± P(a z ) = § 

3.29. Let P be a probability function on S = {a v a 2 , a 3 }. Find P{a x ) if (i) P(a 2 ) = J and P(a z ) = 

(ii) P(a x ) - 2 P(a 2 ) and P{a z ) = L (iii) P({a 2 ,a z }) = 2P(a 1 ) f (iv) P(a z ) = 2 P(a 2 ) and 

P(a 2 ) — 3P(a 1 ). 

3.30. A coin is weighted so that heads is three times as likely to appear as tails. Find P(H) and P(T). 

3.31. Three students A , B and C are in a swimming race. A and B have the same probability of winning 
and each is twice as likely to win as C. Find the probability that B or C wins. 

3.32. A die is weighted so that the even numbers have the same chance of appearing, the odd numbers 
have the same chance of appearing, and each even number is twice as likely to appear as any 
odd number. Find the probability that (i) an even number appears, (ii) a prime number appears, 

(iii) an odd number appears, (iv) an odd prime number appears. 

3.33. Find the probability of an event if the odds that it will occur are (i) 2 to 1, (ii) 5 to 11. 

3.34. In a swimming race, the odds that A will win are 2 to 3 and the odds that B will win are 1 to 4. 

Find the probability p and the odds that A or B wins the race. 


FINITE EQUIPROBABLE SPACES 

3.35. A class contains 5 freshmen, 4 sophomores, 8 juniors and 3 seniors. A student is chosen at random 
to represent the class. Find the probability that the student is (i) a sophomore, (ii) a senior, 
(iii) a junior or senior. 

3.36. One card is selected at random from 50 cards numbered 1 to 50. Find the probability that the 

number on the card is (i) divisible by 5, (ii) prime, (iii) ends in the digit 2. 

3.37. Of 10 girls in a class, 3 have blue eyes. If two of the girls are chosen at random, what is the 

probability that (i) both have blue eyes, (ii) neither has blue eyes, (iii) at least one has blue eyes? 

3.38. Three bolts and three nuts are put in a box. If two parts are chosen at random, find the 
probability that one is a bolt and one a nut. 
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3.39. Ten students, A,B, ..., are in a class. If a committee of 3 is chosen at random from the class, 

find the probability that (i) A belongs to the committee, (ii) B belongs to the committee, 

(iii) A and B belong to the committee, (iv) A or B belongs to the committee. 

3.40. A class consists of 6 girls and 10 boys. If a committee of 3 is chosen at random from the class, 

find the probability that (i) 3 boys are selected, (ii) exactly 2 boys are selected, (iii) at least one 

boy is selected, (iv) exactly 2 girls are selected. 

3.41. A pair of fair dice is tossed. Find the probability that the maximum of the two numbers is 
greater than 4. 

3.42. Of 120 students, 60 are studying French, 50 are studying Spanish, and 20 are studying French 
and Spanish. If a student is chosen at random, find the probability that the student (i) is studying 
French or Spanish, (ii) is studying neither French nor Spanish. 

3.43. Three boys and 3 girls sit in a row. Find the probability that (i) the 3 girls sit together, 
(ii) the boys and girls sit in alternate seats. 


NONCOUNTABLE UNIFORM SPACES 

3.44. A point is selected at random inside an equilateral triangle whose side length is 3. Find the 
probability that its distance to any corner is greater than 1. 

3.45. A coin of diameter ^ is tossed randomly onto the Cartesian plane R 2 . Find the probability that the 
coin does not intersect any line whose equation is of the form (a) x — k, (b) x + V = k, (c) x = k 
or y ~ k. (Here k is an integer.) 


3.46. A point X is selected at random from a line segment AB with midpoint O. Find the probability that 
the line segments AX, XB and AO can form a triangle. 


MISCELLANEOUS PROBLEMS 

3.47. Let A and B be events with P(AuB) = ^ , P(AnB) = ^ and P(A C ) = Find P(A), P(B) and 
P(An£c). 

3.48. Let A and B be events with P(A) = 1, P(AuB) = £ and P(B C ) = ||. Find P(AnB), P(A c nB c ), 
P(A c uB c ) and P(BnA c ). 

3.49. A die is tossed 50 times. The following table gives the six numbers and their frequency of 
occurrence: 


Number 

1 

2 

3 

4 

5 

6 

Frequency 

7 

9 

8 

7 

9 

10 


Find the relative frequency of the event (i) a 4 appears, (ii) an odd number appears, (iii) a prime 
number appears. 

3.50. Prove: For any events A v A 2 , ..., A n , 

P(A 1 u---uA„) = 2P(A f ) - 2 PiAiHAj) + 2 P(A f nAjnA k ) - ± P(A,n• • • nA„) 

i i <3 i<j<k 

(Remark: This result generalizes Theorem 3.5 and Corollary 3.6.) 
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Answers to Supplementary Problems 

3.25. (i) AkjBc, (ii) (AuB) c 

3.26. (i) (AnB c nC c )u (BnA c nC c )u (CnA c nB c ) (iii) (AuBuC)' 

(ii) (AnB)u(AnOu(BnC) (iv) (AuB)nC' 

3.27. (i) S = (HH1, HH 2 , HH3, HH4, HH5, HH 6 , HT1, HT2, HT3, HT4, HT5, HT 6 , 

TH1, TH2, TH3, TH4, TH5, TH 6 , TT 1 , TT 2 , TT3, TT4, TT5, TT 6 } 

(ii) A = {HH2, HH 4 , HH 6 }, B = {HH 2 , HT2, TH2, TT 2 }, C = {HT 2 , TH2, HT3, TH3, HT5, TH5} 

(iii) (a) AnB = {HH2} 

(b) B\(AuC) = {TT2} 

(c) BuC = {HH2, HT2, TH2, TT2, HT3, TH3, HT5, TH5) 

3.28. (i) no, (ii) no, (iii) yes, (iv) yes 

3.29. (i) (ii) $, (iii) 4> (iv) -jL 

3.30. P(H) = f, P(T) = i 

3.31. f 

3.32. (i) f, (ii) f, (iii) 4> (v) § 

3.33. (i) f, (ii) ^ 

3.34. p = f; the odds are 3 to 2. 

3.35. (i) gj (ii) 2 q» (iii) 20 

3.36. (i) (ii) (iii) ^ 

3.37. (i) (ii) js, (iii) ^ 

3.38. § 

3.39. (i) jjj, (ii) fo’ (>») & ( iv ) fi 

3.40. (i) (”) %’ ( m ) §> ( iv ) sf 

3.41. f 

3.42. (i) f, (ii) i 

3.43. (i)£, (ii) 

3.44. l-2^/(9V3) 

3.45. (i)l, (ii) 1 - $V2, (iii) 4 

3.46. i 

3.47. P(A) = f, P(B) = f, P(AnB c ) = 4 

3.48. P(AnB) = 4 , P(A c nR c ) = 4 , P(A°l>B') = 4 , P(BnA') = 4 

3.49. (i)^, (ii) f |1 (iii) 1 




Chapter 4 


Conditional Probability 
and Independence 


CONDITIONAL PROBABILITY 


Let E be an arbitrary event in a sample space S with P(E) > 0. The probability that 
an event A occurs once E has occurred or, in other words, the conditional probability of 
A given E, written P(A \ E), is defined as follows: 


P{A\E) = 


P(Ar\E) 

P(E) 


As seen in the adjoining Venn diagram, P(A \ E) in a 
certain sense measures the relative probability of A 
with respect to the reduced space E. 



In particular, if S is a finite equiprobable space and |A| denotes the number of elements 
in an event A, then 


P(Ar\E) = P(E) 

That is, 


m 

|S| 


and so 


P(A | E) 


P(AnE) __ \AnE\ 

P(E) ~ |ff| 


Theorem 4.1: 


Let S be a finite equiprobable space with events A and E. Then 


or 


P(A | E) = 


number of elements in A n E 
number of elements in E 


P(A | E) = 


number of ways A and E can occur 
number of ways E can occur 


Example 4.1: Let a pair of fair dice be tossed. If the sum is 6, find the probability that one of 
the dice is a 2. In other words, if 

E = {sum is 6} = {(1,5), (2,4), (3,3), (4,2), (5,1)} 
and A = {a 2 appears on at least one die} 

find P(A | E). 

Now E consists of five elements and two of them, (2,4) and (4,2), belong to A: 
AnE — {(2,4), (4,2)}. Then P(A\E)=\. 

On the other hand, since A consists of eleven elements, 

A = {(2,1), (2, 2), (2, 3), (2, 4), (2, 5), (2, 6), (1, 2), (3,2), (4, 2), (5, 2), (6,2)} 
and S consists of 36 elements, P(A ) = 

Example 4.2: A man visits a couple who have two children. One of the children, a boy, comes 
into the room. Find the probability p that the other is also a boy if (i) the other 
child is known to be younger, (ii) nothing is known about the other child. 

The sample space for the sex of two children is S = {66, bg, gb, gg} with 
probability ^ for each point. (Here the sequence of each point corresponds to the 
sequence of births.) 

(i) The reduced sample space consists of two elements, {66, bg}; hence p = 1. 

(ii) The reduced sample space consists of three elements, {66, bg, gb}; hence p = 
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MULTIPLICATION THEOREM FOR CONDITIONAL PROBABILITY 

If we cross multiply the above equation defining conditional probability and use the 
fact that AnE = EnA, we obtain the following useful formula. 

Theorem 4.2: P(E n A) = P{E) P(A \ E) 

This theorem can be extended by induction as follows: 

Corollary 4.3: For any events A h A 2 , .. .,A n , 

P(AiDA 2 n • * • nA„) 

= P(A 1 )P(A 2 \Ai)P(A 3 \A 1 nA 2 )“ •P(A n \A 1 nA 2 n • • • nAn-i) 

We now apply the above theorem which is called, appropriately, the multiplication 
theorem . 

Example 4.3: A lot contains 12 items of which 4 are defective. Three items are drawn at ran¬ 

dom from the lot one after the other. Find the probability p that all three are 
nondefective. 

The probability that the first item is nondefective is ~ since 8 of 12 items are 
nondefective. If the first item is nondefective, then the probability that the next 
item is nondefective is ~ since only 7 of the remaining 11 items are nondefective. 
If the first two items are nondefective, then the probability that the last item is 
nondefective is A since only 6 of the remaining 10 items are now nondefective. 
Thus by the multiplication theorem, 

8 7 6 _ 14 

P 12 ‘ 11 * 10 55 

FINITE STOCHASTIC PROCESSES AND TREE DIAGRAMS 

A (finite) sequence of experiments in which each experiment has a finite number of 
outcomes with given probabilities is called a ( finite ) stochastic process . A convenient 
way of describing such a process and computing the probability of any event is by a tree 
diagram as illustrated below; the multiplication theorem of the previous section is used to 
compute the probability that the result represented by any given path of the tree does occur. 

Example 4.4: We are given three boxes as follows: 

Box I has 10 light bulbs of which 4 are defective. 

Box II has 6 light bulbs of which 1 is defective. 

Box III has 8 light bulbs of which 3 are defective. 

We select a box at random and then draw a bulb at random. What is the proba¬ 
bility p that the bulb is defective? 

Here we perform a sequence of two experiments: 

(i) select one of the three boxes; 

(ii) select a bulb which is either defective (. D ) or nondefective (N). 

The following tree diagram describes this process and gives the probability of each 
branch of the tree: 
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The probability that any particular path of the tree occurs is, by the multiplica¬ 
tion theorem, the product of the probabilities of each branch of the path, e.g., 
the probability of selecting box I and then a defective bulb is 

Now since there are three mutually exclusive paths which lead to a defective 
bulb, the sum of the probabilities of these paths is the required probability: 

_ 1 2,1 1,1 3_ 113 

P 3*5 + 3'6 + 3'8 360 


Example 4.5: A coin, weighted so that P(H) = § and P(T) = is tossed. If heads appears, 
then a number is selected at random from the numbers 1 through 9; if tails ap¬ 
pears, then a number is selected at random from the numbers 1 through 5. Find 
the probability p that an even number is selected. 

The tree diagram with respective probabilities is 



Note that the probability of selecting an even number from the numbers 1 
through 9 is ^ since there are 4 even numbers out of the 9 numbers, whereas the 
probability of selecting an even number from the numbers 1 through 5 is since 
there are 2 even numbers out of the 5 numbers. Two of the paths lead to an 
even number: HE and TE. Thus 


p = P( E) 


2 4,12 

3 " 9 + 3 ’ 5 
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PARTITIONS AND BAYES’ THEOREM 


Suppose the events Ai,A 2 , . . ., A„ form a partition 
of a sample space S; that is, the events A { are mutually 
exclusive and their union is S. Now let B be any other 
event. Then 

B = SnB = (AiUA 2 U • • • UA n ) n B 
= (A, n B) u (A 2 n P) u • • • u (A„ n P) 

where the A f nP are also mutually exclusive. Ac¬ 
cordingly, 

P(P) = P(A,nP) + P(A 2 nP) + ••• 

Thus by the multiplication theorem, 

P(B) = P(Ai) P(P | Ai) + P(A 2 )P(P|A 2 ) + ••• + P(A n ) P(B \A n ) (1) 

On the other hand, for any i, the conditional probability of Ai given B is defined by 



+ P(A n n B ) 


P(A< | B) 


P(AinP) 

P(B) 


In this equation we use ( 1 ) to replace P(B) and use P(AinP) = P(Ai)P(P|A,) to replace 
P(AinP), thus obtaining 


Bayes’ Theorem 4.4: 

P(Ai | B) = 


Suppose Ai, A 2 , ..., A n is a partition of S and B is any event, 
for any i, 

P(A t )P(B\Ai) 

P(Ai)P(B | Ai) + P(A 2 )P(P|A 2 ) + ••• +P(A„)P(P|A„) 


Then 
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Example 4.6: 


Three machines A, B and C produce respectively 50%, 30% and 20% of the total 
number of items of a factory. The percentages of defective output of these ma¬ 
chines are 3%, 4% and 5%. If an item is selected at random, find the probability 
that the item is defective. 


Let X be the event that an item is defective. 
Then by (1) above, 

P(X) = P(A) P(X | A) + P(B) P(X | B) 

+ p(C) P(X | o 

= (.50)(.03) + (.30) (.04) + (,20)(.05) 

= .037 

Observe that we can also consider this problem as 
a stochastic process having the adjoining tree 
diagram. 



Example 4.7: Consider the factory in the preceding example. Suppose an item is selected at 

random and is found to be defective. Find the probability that the item was pro¬ 
duced by machine A; that is, find P(A \ X ). 

By Bayes* theorem, 

_ _ P(A)P(X\A) _ 

^ 1 } P(A)P(X\A) 4- P(B)P(X\B) + P(C)P(X\C) 

_ (.50)(.03) _ _ 15 

(.50)(.03) + (.30)(.04) + (.20)(.05) 37 

In other words, we divide the probability of the required path by the probability 
of the reduced sample space, i.e. those paths which lead to a defective item. 


INDEPENDENCE 

An event B is said to be independent of an event A if the probability that B occurs is 
not influenced by whether A has or has not occurred. In other words, if the probability 
of B equals the conditional probility of B given A: P(B) = P(B\A). Now substituting 
P(B) for P(B\A) in the multiplications theorem P(AC\B) — P(A) P(B \A), we obtain 

P(AHB) = P(A)P(B) 

We use the above equation as our formal definition of independence. 

Definition: Events A and B are independent if P(AC\B) — P(A)P(B ); otherwise they 

are dependent. 

Example 4.8: Let a fair coin be tossed three times; we obtain the equiprobable space 

S = (HHH, HHT, HTH, HTT, THH, THT, TTH, TTT} 

Consider the events 

A — (first toss is heads}, B = (second toss is heads} 

C = (exactly two heads are tossed in a row} 

Clearly A and B are independent events; this fact is verified below. On the other 
hand, the relationship between A and C or B and C is not obvious. We claim 
that A and C are independent, but that B and C are dependent. We have 

P(A) = P({HHH, HHT, HTH, HTT}) = | = | 

P(B) = P({HHH, HHT, THH, THT}) = | = | 

P(C) = P({HHT, THH}) = f = \ 

Then 

P(AnP) = P({HHH, HHT}) = i P(AnC) = P({HHT}) = i, 

4 o 

P(B n C) = P({HHT, THH}) = ( 
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Accordingly, 

P(A) P{B) ~ \ — \ ~ P{AnB), and so A and B are independent; 

P(A) P(C) — \ ~ | — P(AnC), and so A and C are independent; 

P(B) P(C) = ~ • | = i # P(BnC), and so B and C are dependent. 

Frequently, we will postulate that two events are independent, or it will be clear from 
the nature of the experiment that two events are independent. 

Example 4.9: The probability that A hits a target is J and the probability that B hits it is §. 

What is the probability that the target will be hit if A and B each shoot at the 
target? 

We are given that P(A) = 1 and P(B) — f, and we seek P(AuB). Further¬ 
more, the probability that A or B hits the target is not influenced by what the 
other does; that is, the event that A hits the target is independent of the event 
that B hits the target: P(Ar\B) = P(A)P(B). Thus 

P(A U B) = P(A) 4- P(B) - P(AnB) = P(A) + P(B) - P(A) P(B) 

= i + 2.1,2 = 11 
4^ 5 4 5 20 

Three events A, B and C are independent if: 

(i) P(AHB) = P(A)P(B), P(AnC) = P(A)P(C) and P(BnC) = P(B)P{C) 
i.e. if the events are pairwise independent, and 

(ii) P(AnBDC) = P(A)P(B)P(C ). 

The next example shows that condition (ii) does not follow from condition (i); in other 
words, three events may be pairwise independent but not independent themselves. 

Example 4.10: Let a pair of fair coins be tossed; here S = {HH, HT, TH, TT} is an equiprobable 
space. Consider the events 

A = {heads on the first coin} = (HH, HT} 

B = {heads on the second coin} = {HH, TH} 

C — {heads on exactly one coin} = {HT, TH} 

Then P(A) = P(B) = P(C) = § = | and 

P{AnB) = P({HH}) = P(A n C) = P({HT}) = P(BnC) = ({TH}) = | 

Thus condition (i) is satisfied, i.e., the events are pairwise independent. However, 
AnBnC = 0 and so 

P(AnBnC) - P(0) = 0 # P(A)P(B)P(C) 

In other words, condition (ii) is not satisfied and so the three events are not inde¬ 
pendent. 


INDEPENDENT OR REPEATED TRIALS 

We have previously discussed probability spaces which were associated with an experi¬ 
ment repeated a finite number of times, as the tossing of a coin three times. This concept 
of repetition is formalized as follows: 

Definition: Let S be a finite probability space. By n independent or repeated trials , we 

mean the probability space T consisting of ordered w-tuples of elements of S 
with the probability of an n-tuple defined to be the product of the probabilities 
of its components: 

P((Sl, S 2 , . . . , S n )) = P(Si) P(S 2 ) * * * P{Sn) 
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Example 4.11: Whenever three horses a, b and e race together, their respective probabilities of 
winning are £ and In other words, S = {a, b, c} with P(a) = •>-, P(b) = l 

and P(c) - £. If the horses race twice, then the sample space of the* 2 repeated 
trials is 

T — {aa, ab, ac , ba, bb, be, ca, cb } cc} 

For notational convenience, we have written ac for the ordered pair (a c) The 
probability of each point in T is 

P(aa) = P(a) P(a) = = \ P( ba) = | P{ea) = i. 

P(ab) = P(a) P(b) =|*|=| P(bb) = | P(cb) = ^ 

P{ac) = P(a) P(c) = 1.1 = ± P(bc) = ^ P(ec) = ± 

Thus the probability of c winning the first race and a winning the second race is 

p ( c «) = -A- 

From another point of view, a repeated 
trials process is a stochastic process whose 
tree diagram has the following properties: 

(i) every branch point has the same outcomes; 

(ii) the probability is the same for each 
branch leading to the same outcome. For 
example, the tree diagram of the repeated 
trials process of the preceding experiment 
is as shown in the adjoining figure. 

Observe that every branch point has the 
outcomes a, b and c, and each branch lead¬ 
ing to outcome a has probability each 

branch leading to b has probability and 
each leading to c has probability £. 



Solved Problems 

CONDITIONAL PROBABILITY IN FINITE EQUIPROBABLE SPACES 

4.1. A pair of fair dice is thrown. Find the probability p that the sum is 10 or greater if 
(i) a 5 appears on the first die, (ii) a 5 appears on at least one of the dice. 

(i) If a 5 appears on the first die, then the reduced sample space is 

A = {(5,1), (5,2), (5, 3), (5,4), (5, 5), (5, 6)} 

The sum is 10 or greater on two of the six outcomes: (5, 5), (5, 6). Hence p = 1 = 1 . 

(ii) If a 5 appears on at least one of the dice, then the reduced sample space has eleven elements: 

B = {(5,1), (5,2), (5, 3), (5,4), (5, 5), (5, 6), (1,5), (2, 5), (3, 5), (4, 5), (6, 5)} 

The sum is 10 or greater on three of the eleven outcomes: (5,5), (5,6), (6,5). Hence p = 
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4.2. Three fair coins are tossed. Find the probability p that they are all heads if (i) the 
first coin is heads, (ii) one of the coins is heads. 

The sample space has eight elements: S = {HHH, HHT, HTH, HTT, THH, THT, TTH, TTT}. 

(i) If the first coin is heads, the reduced sample space is A = {HHH, HHT, HTH, HTT}. Since 
the coins are all heads in 1 of 4 cases, p = J. 

(ii) If one of the coins is heads, the reduced sample space is B ~ (HHH, HHT, HTH, HTT, THH, 
THT, TTH}. Since the coins are all heads in 1 of 7 cases, p = 1. 

4.3. A pair of fair dice is thrown. If the two numbers appearing are different, find the 
probability p that (i) the sum is six, (ii) an ace appears, (iii) the sum is 4 or less. 

Of the 36 ways the pair of dice can be thrown, 6 will contain the same numbers: (1,1), (2,2), 
.. ( 6 , 6 ). Thus the reduced sample space will consist of 36 — 6 = 30 elements. 

(i) The sum 6 can appear in 4 ways: (1, 5), (2,4), (4, 2 ), (5, 1 ). (We cannot include ( 3 , 3 ) since 
the numbers are the same.) Hence p — 

(ii) An ace can appear in 10 ways: ( 1 , 2 ), (1,3), ..., ( 1 , 6 ) and (2,1), ( 3 , 1 ), ..., (6,1). Hence 

^ 30 ~ 3 * 

(iii) The sum of 4 or less can occur in 4 ways: (3, 1 ), ( 1 , 3 ), ( 2 , 1 ), (1,2). Thus p = ^ 

4.4. Two digits are selected at random from the digits 1 through 9. If the sum is even, 
find the probability p that both numbers are odd. 

The sum is even if both numbers are even or if both numbers are odd. There are 4 even 
numbers (2,4, 6 , 8 ); hence there are (J) = 6 ways to choose two even numbers. There are 5 odd 
numbers (1,3, 5, 7,9); hence there are (®) — 10 ways to choose two odd numbers. Thus there are 
6 + 10 = 16 ways to choose two numbers such that their sum is even; since 10 of these ways occur 
when both numbers are odd, p = ^ = §. 


4.5. A man is dealt 4 spade cards from an ordinary deck of 52 cards. If he is given 
three more cards, find the probability p that at least one of the additional cards is 
also a spade. 

Since he is dealt 4 spades, there are 52 — 4 = 48 cards remaining of which 13 — 4 = 9 are 
spades. There are ( 4 g ) = 17,296 ways in which he can be dealt three more cards. Since there are 
48 — 9 = 39 cards which are not spades, there are ( 3 g 9 ) = 9139 ways he can be dealt three cards 
which are not spades. Thus the probability q that he is not dealt another spade is q — ; 

hence p = l -q = 


4.6. Four people, called North, South, East and West, are each dealt 13 cards from an 

ordinary deck of 52 cards. 

(i) If South has no aces, find the probability p that his partner North has exactly 
two aces. 

(ii) If North and South together have nine hearts, find the probability p that East 
and West each has two hearts. 

(i) There are 39 cards, including 4 aces, divided among North, East and West. There are 
(j 9 ) ways that North can be dealt 13 of the 39 cards. There are (g) ways he can be dealt 2 of 
the four aces, and ( 3 *) ways he can be dealt 11 cards from the 39 — 4 = 35 cards which are 
not aces. Thus g5 

_ ( 2 )(n) __ 6-12*13-25-26 __ 650 

V ( 39 \ 36- 37* 38* 39 2109 




CHAP. 4] 


CONDITIONAL PROBABILITY AND INDEPENDENCE 


tw p cards, including 4 hearts, divided among East and West. There are ( 26 ) wavs 

that East can be dealt 13 cards. (We need only analyse East’s 13 cards s nee w"s mus 

have the remaining cards.) There are (<) ways East can be dealt 2 hearts from 4 hearts 
( n ) ways he can be dealt 11 non-hearts from the 26 - 4 = 22 non-hearts. Thus 

p - _ 6 *12 *13-12-13 _ 234 

(is) 23 • 24 • 25 • 26 “ 675 

multiplication theorem 

4 ' 7 ' at ^- 

i6 Z 2S Sft. 1 “tt d '* * T* 12/15 s,nce ,im * re 12 b «” -*« 

since there ar, U boys left out of 16 studeltte FinolH if f' “t TT' * 102 ls 11/15 

then the probability that the third student is a bov^Tn/l^* tw ° students selected were boys, 

14 students. Thus, by the multiplication theorem, the probability tha/aH ^hreTare^boyl 

p = 12.11.10 _ 11 

16 15 14 “ 28 

,he 16 " d 

to select three stu^'and ll^T^lJ wa^l^seJeTt twtoysfheJe'p Sm - h® ‘ 14 WayS 

9 ' V — 16 . 15.14 “ 28 * 

“ t z r Jr an ° rdinary deck ° f 62 -*■ wh “‘ 

::rr7^ 

ach case that the previous cards were spades,) Thus v = — . £ • . 12 . ± __ 33 

52 51 50 4 9 4 8 66,640 * 

4 ' 9 ' the ZZeZlr the ^ ft™ 68 ' Three marbles from 

the third is white. ' d the pr ' lbablllt >' P that the first two are red and 

m.r£ rVb-?: io ^“ ,re 7 ™ i »* »<« 

«»r, 6 riK , tttfSSLu'Z.’Zfr** S " d “ M 

the probability that the third marble is white is 3/8 since first tw ° marbles are red, then 

8 marbles in the urn. Hence by the multiplication theorem ““ marbleS ° Ut of the 

„ - 7 6 3 7 

10 9 8 40 

4 ' 10 * I*! a Cl ^u;T Sel6Cted at random ’ one after the other, for an examina- 

. md the probability p that the boys and girls in the class alternate if a\ +B 
class consists of 4 boys and 3 girls, (ii) the class^onsLts of fboyf S 3 girls! } 

r f ,o l b.’b5 S ,r.J .t r '«",'l“.‘w';f4 , /, th ‘, n f ?b“ "," 1 , M ” d “* ““‘“ d »»* >» . boy. The 
second is a girl is 3/6 since there are 3 eirls 01 f-% a b ° y ’ then th . e P robabilit y that the 

we obtain the probability that the third „ i, ° o/r l J deid ' s left ‘ Continuing in this manner, 
boy ,o«,tho»iah “ 2/4 ’ ,te is * 


V 
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(ii) There are two mutually exclusive cases: the first pupil is a boy, and the first is a girl. If the 
first student is a boy, then by the multiplication theorem the probability p l that the students 

alternat6iS _ 3 3 2 2 1 1 _ X 

Pl 6 * 5 ' 4 ’ 3 * 2 * 1 20 

If the first student is a girl, then by the multiplication theorem the probability p 2 that 
the students alternate is 

_ 3 _ 3 _ 2 _ 2 1 1 _ J_ 

P2- 6*5'4'3*2*l 20 


Thus p = p 1 + p 2 


— + — 
20 ' 20 


10 * 


MISCELLANEOUS PROBLEMS ON CONDITIONAL PROBABILITY 

4.11. In a certain college, 25% of the students failed mathematics, 15% of the students 
failed chemistry, and 10% of the students failed both mathematics and chemistry. 
A student is selected at random. 

(i) If he failed chemistry, what is the probability that he failed mathematics? 

(ii) If he failed mathematics, what is the probability that he failed chemistry? 

(iii) What is the probability that he failed mathematics or chemistry ? 

Let M = (students who failed mathematics) and C = (students who failed chemistry); then 
P(M) = .25, P(C) = .15, P(MnC) = .10 


(i) 


The probability that a student failed mathematics, given that he has failed chemistry is 


P(M | C) 


P(M n C) 
P(C) 


JU) 2 

.15 3 


(ii) The probability that a student failed chemistry, given that he has failed mathematics is 

P(clM) = PjCnM) 40 _ 2 

/ I M > p( M ) .25 5 

(iii) P(MuC) = P(M) + P{C) — P(M n C) = .25 + .15 -.10 = .30 = ^ 


4.12. 


Let A and B be events with P{A) = P(B) — i and P(AnB) — J. 
(ii) P(B | A), (iii) P(AuP), (iv) P(A C | P c ), (v)P(P c |A c ). 


(i) P(A | B) 


P(AnB) 

P(B) 



(ii) P(B\A) 


P(BnA) 

P(A) 


Find (i) P(A | B), 


1 

2 


(iii) P(AuB) = P(A) + P(B) - P(AnB) = f+f-J 


_ 7 _ 

12 


(iv) 


(v) 


First compute P(B C ) and P(A c nB c ). P(B C ) — 1 P(B) — 1 i — 
law, (Au£)= = AcnP c ; hence P(A c nB c ) = P((AuB) c ) = 1 — P(AuB) = 


Thus P(A C | B c ) = 


P(AcnP c ) 

P(P C ) 


5 

1 2 
2 
a 


5 

8’ 


P{A c) = 1 - P(A) =1-1 


= Then P(B<> \ A c ) = 


P(B c nA c ) 

P(A<) 



By De Morgan’s 

_ 7 — 5 

12 12 * 


4.13. Let A and B be events with P(A) = f, P(B ) = f and P(AUB) — f. Find P(A \ B) 
and P(B\A). 

First compute P(AnB) using the formula P(AuB) = T*(A) + P{B) — P(AnB): 

1 


j = hl~ P(AnB) 01 


P(A n B) = 


Then P(A\B)=^^= | = | and P(B | A) = = | = * 


3* 
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4.14. Find P(B | A) if (i) A is a subset of B, (ii) A and B are mutually exclusive. 

A } s 3 subset of B, then whenever A occurs B must occur; hence P(B I A) = 1 Alternately 
if A is a subset of B then A nB = A ; hence y ’ 


P(B | A) 


P(A nB) P(A) 

P(A) P(A) ~ 



(ll) p/7? 1“ UtUa '^ e ^lusive, i.e. disjoint, then whenever A occurs B cannot occur; hence 
P(U\A) — 0. Alternately, if A and B are mutually exclusive then AnB = 0; hence 


P(B | A) 


_ P(A n B) _ P(0) 


0 


P(A) 


P(A) P(A) 


= 0 


4.15. 


Three machines A, B and C produce respectively 60%, 30% and 10% of the total 
number of items of a factory. The percentages of defective output of these machines 
are respectively 2%, 3% and 4%. An item is selected at random and is found defective. 
Find the probability that the item was produced by machine C. 


„ Let 7 i defective items >- We seek P(C | X), the probability that an item is produced 
C given that the item is defective. By Bayes’ theorem, 


by machine 


P(C \X) 


_ P(C) P(X I C) 

P(A)P(Y|A) + P(P)P(Y|.B) + P(C)P(Y|(7) 

(•10K.04) _ _ 

(.60)(.02) + (,30)(.03) + (,10)(.04) 25 


4.16. 


In a certain college, 4% of the men and 1% of the women are taller than 6 feet. 
Furthermore, 60% of the students are women. Now if a student is selected at 
random and is taller than 6 feet, what is the probability that the student is a woman? 


Let A = {students taller than 6 feet}. We seek P( W | A), the probability that 
woman given that the student is taller than 6 feet. By Bayes’ theorem, 


a student is a 


P(W | A) 


P(W)P(A I TP) 

P(W) P(A | TP) + P(M) P(A | W) 


(.60H.01) _ _3_ 

(.60) (.01) + (,40)(.04) 11 


4A7. Let E 'be an event for which P(E) > 0. Show that the conditional probability function 
P(*\E) satisfies the axioms of a probability space; that is, 

[Pi] For any event A, 0 =£ P(A | E) ^ 1. 

[P2] For the certain event S, P(S j E) = 1. 

[P 3 ] If A and B are mutually exclusive, then P(AUB | E) = P(A \ E) + P(B \ E). 

[P4] If Ai, A 2 , ... is a sequence of mutually exclusive events, then 
P(AiUA 2 u ••• \E) = P{Ai | E) + P(A21 E) + ... 


(i) 


We have A nE c E; hence P(A n E) =£ P(E). Thus P(A I E) = f>(AnE) == 1 
negative. That is, 0 ^ P(A\E) ^ l and so [PJ holds. P(B) 


and is also non- 
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(ii) We have SnE — E; hence P(S | E) = P %2,f~ = = 1. Thus [P 2 ] holds. 

r\E*) 


(iii) If A and B are mutually exclusive events, then so are Ar\E and Bc\E. Furthermore, 
{A\JB)nE — (An 2?) U {BnE). Thus 


and therefore 


P((AuB)nE) = P((AnE) u (BnE)) = P(AnE) + P(BnE) 


P(AuB\E) 


P((AuB)nE) 

P(E) 


P(AnE) + P(BnE) 
P(E) 


Hence [P 3 ] holds. 


P(AnE) P(BnE) 
P(E) + P(E) 


P(A | E) + P(B | E) 


(iv) 


Similarly if A v A 2 , ... are mutually exclusive, then so are AiHE, A 2 nE, ... . Thus 


PfCAjUAjU'-Ontf) = p^A^E) u(A 2 r\E)u •••) = P(A t nE) + P(A 2 nE) + • • • 


and therefore 

PiA^AzU--- | E) 


That is, [P 4 ] holds. 


P((A x uA 2 u • • •) n E) _ P(A x nE) + P(A 2 nE) + • ■ • 
P{E) ~ P(E) 


PiA^E) ( P(A 2 nE) 
P(E) + P(E) 


P(A 1 \E) + P(A 2 \E) + ••• 


FINITE STOCHASTIC PROCESSES 

4.18. A box contains three coins; one coin is fair, one coin is two-headed, and one coin is 
weighted so that the probability of heads appearing is £. A coin is selected at 
random and tossed. Find the probability p that heads appears. 


Construct the tree diagram as shown in Figure (a) below. Note that I refers to the fair coin, 
II to the two-headed coin, and III to the weighted coin. Now heads appears along three of the 
paths; hence 

P = 


1 1 . 1 1 , 1 1 11 

— • — -f- —- • 1 4- — • — — - 

32 3 33 18 




4.19. We are given three urns as follows: 

Urn A contains 3 red and 5 white marbles. 

Urn B contains 2 red and 1 white marble. 

Urn C contains 2 red and 3 white marbles. 

An urn is selected at random and a marble is drawn from the urn. If the marble 
is red, what is the probability that it came from urn A ? 

Construct the tree diagram as shown in Figure (b) above. 

We seek the probability that A was selected, given that the marble is red; that is, P(A | R ). 
In order to find P(A \ R), it is necessary first to compute P(AnR) and P(R)- 

The probability that urn A is selected and a red marble drawn is i * f = jl that is, 
P(AnR) =|. Since there are three paths leading to a red marble, P(R) = | + |+ | # f = Mo* 
Thus 
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P(A | R) 


Alternately, by Bayes' theorem, 


P(AnR) 

P(R) 


2 _ 

8 

173 

360 


45 

173 


P(A | R) 


_ P(A)P(R\A) _ 

P(A) P(R | A) + P(B) P(R | B) + P(C) P(R | C) 

_ill_ _ _46_ 

**§ + i*i + i*t 173 


4.20. 


Box A contains nine cards numbered 1 through 9, and box B contains five cards 
numbered 1 through 5. A box is chosen at random and a card drawn. If the number 
is even, find the probability that the card came from box A. 


The tree diagram of the process is shown in Figure (a) below. 

We seek P(A \ E), the probability that A was selected, given that the number is even. The 
probability that box A and an even number is drawn is = that is, P(AC\E) = |. Since 
there are two paths which lead to an even number, P(E) = + = Thus 


P(A | E) 


2 

P(AnE) _ j)_ _ io 
P(E) ~ li “ T9 



4.21, An urn contains 3 red marbles and 7 white marbles. A marble is drawn from the 
urn and a marble of the other color is then put into the urn. A second marble is 
drawn from the urn. 

(i) Find the probability p that the second marble is red. 

(ii) If both marbles were of the same color, what is the probability p that they were 
both white? 

Construct the tree diagram as shown in Figure (6) above. 

(i) Two paths of the tree lead to a red marble: p = 

(ii) The probability that both marbles were white is The probability that both 

marbles were of the same color, i.e. the probability of the reduced sample space, is 
10 * 10 + Io * 10 = 25 * Hence the conditional probability p = §£/|§ = 


4,22. We are given two urns as follows: 

Urn A contains 3 red and 2 white marbles. 

Urn B contains 2 red and 5 white marbles. 

An urn is selected at random; a marble is drawn and put into the other urn; then 
a marble is drawn from the second urn. Find the probability p that both marbles 
drawn are of the same color. 

Construct the following tree diagram: 
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Note that if urn A is selected and a red marble drawn and put into urn B, then urn B has 
3 red marbles and 5 white marbles. 

Since there are four paths which lead to two marbles of the same color, 

1 3 3 , 1 23,12 2,1 51 _ 901 

P 2 * 5*8 2*5*4 2*7*3 2*7*2 1680 


INDEPENDENCE 

4.23. Let A = event that a family has children of both sexes, and let B = event that a 
family has at most one boy. (i) Show that A and B are independent events if a 
family has three children, (ii) Show that A and B are dependent events if a family 
has two children. 

(i) We have the equiprobable space S = {bbb, bbg, bgb, bgg, gbb, gbg, ggb, ggg}. Here 

6 3 

A = {bbg, bgb , bgg, gbb, gbg, ggb} and so P(A) ~ g ” 4 

4 1 

B = {bgg, gbg, ggb, ggg} and so P(B) = - = ~ 

3 

AnB = {bgg, gbg, ggb} and so P(AnB) = g 

Since P(A) P(B) = £•-! = <§• = P(AnB), A and B are independent. 

(ii) We have the equiprobable space 5 = {66, bg, gb, gg}. Here 

A — {bg, gb} and so P(A) = | 

B = {bg, gb, gg} and so P(B) — | 

AnB = {bg, gb} and so P(AnB) = ~ 

Since P(A) P{B) ¥* P(AnB) t A and B are dependent. 


4.24. Prove: If A and B are independent events, then A c and B c are independent events. 

P(AcnB') = P((AuB)c) = 1 - P(AuB) = 1 - P(A) - P(B) + P(AnB) 

= 1 - P(A) - P(B) A P(A) P(B) = [1 - P(A)][ 1 - P(B)] = P(A°) P{B<) 

4.25. The probability that a man will live 10 more years is i, and the probability that his 
wife will live 10 more years is i. Find the probability that (i) both will be alive in 
10 years, (ii) at least one will be alive in 10 years, (iii) neither will be alive in 
10 years, (iv) only the wife will be alive in 10 years. 

Let A = event that the man is alive in 10 years, and B — event that his wife is alive in 
10 years; then P(A) — £ and P(B) = -J. 

(i) We seek P(AnB). Since A and B are independent, P(AnB) = P(A) P(J5) — — tz- 
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(ii) We seek P(AuB). P(AuB) = P(A) + P(B) - P(A nB) = £ + £ — Jg = £. 

(iii) We seek P(A'r\B'). Now P(A«) = 1 - P(A) = 1 - £ = 4 and P(£<0 = 1 - P(B) = 1 - £ = |- 
Furthermore, since A c and P c are independent, P(A C C\B C ) = P(A C ) P(B C ) = 4 * 4 = £• 

Alternately, since (A uP) c = A c nP c , P(A c nP c ) = P((AuB) c ) — 1 — P(AUP) = 1 — £ = £• 

(iv) We seek P(A c nP). Since P(A C ) = 1 — P(A) = 4 and A c and B are independent (see Problem 
4.56), P(A c nB) — P(A C ) P(B) = £. 


4.26. Box A contains 8 items of which 3 are defective, and box B contains 5 items of which 
2 are defective. An item is drawn at. random from each box. 

(i) What is the probability p that both items are nondefective ? 

(ii) What is the probability p that one item is defective and one not? 

(iii) If one item is defective and one is not, what is the probability p that the defec¬ 
tive item came from box A ? 

(i) The probability of choosing a nondefective item from A is I and from B is I. Since the 

5 3 3 ° ° 

events are independent, P = s "5 ~IT* 

(ii) Method 1. The probability of choosing two defective items is §•§ — Jo* From (i) the 
probability that both are nondefective is f. Hence = 1 — | — 

Method 2. The probability p x of choosing a defective item from A and a nondefective item 
from B is = The probability p 2 of choosing a nondefective item from A and a 

o o 5 2 1 9 1 19 

defective item from B is 3*5 = 4 . Hence p — Pi + p 2 — + 4 — 


(iii) Consider the events X = {defective item from A} and Y = {one item is defective and one 
nondefective}. We seek P(X | Y). By (ii), P(XnY) = p x — ~ and P(Y) = ~|. Hence 


V = P(X\Y) 


P(X n Y) & _ 9 

P(Y) ~ 12 19 

V 40 


4.27. The probabilities that three men hit a target are respectively l and Each 
shoots once at the target, (i) Find the probability p that exactly one of them hits 
the target, (ii) If only one hit the target, what is the probability that it was the 
first man ? 

Consider the events A = {first man hits the target}, B = {second man hits the target}, and 
C = {third man hits the target}; then P(A) = 1, P(B) = J and P(C) = The three events are 
independent, and P(A C ) = P(B C ) = £, P(C c ) = §. 

(i) Let E = {exactly one man hits the target}. Then 

E = (AnB c nC c )u (A c nBnC c )u (A c r\B c nC) 

In other words, if only one hit the target, then it was either only the first man, A D B c n C c , 

or only the second man, A c nBnC c , or only the third man, A c nB c nC. Since the three events 

are mutually exclusive, we obtain (using Problem 4.62) 

p = P(E) = P{Ar\B c nC c ) + P(A c nBn&) + P(A c nB c nC) 

~ P(A) P(B C ) P{C c ) + P(Ac) P{B) P(C c ) + P(A*) P(B C ) P(C) 

1 3 2 5 # 1 # 2 5 # 3 1 i , JL + JL - §1 

“ 6*4*3 6*4*3 ' 6*4*3 12 36 24 72 


(ii) 


We seek P(A \ E), the probability that the first man hit the target given that only one man 
hit the target. Now AnE = AnB c nC c is the event that only the first man hit the target. 
By (i), P(AnE) = P{AnB c nC c ) = ^ and P(E) = f|; hence 


P(A | E) 


P(AnE) 

P(E) 


12 _ _ 6 _ 
— 31 

72 
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INDEPENDENT TRIALS 

4.28. A certain type of missile hits its target with probability .3. How many missiles 
should be fired so that there is at least an 80% probability of hitting a target? 

The probability of a missile missing its target is ,7; hence the probability that n missiles miss a 
target is (.7)”. Thus we seek the smallest n for which 

1 — (.7) n > .8 or equivalently (,7) n < .2 

Compute: (.7) 1 = .7, (.7) 2 = .49, (.7) 3 = .343, (.7) 4 - .2401, (.7) 5 = .16807. Thus at least 5 missiles 
should be fired. 


4.29. A certain soccer team wins (W) with probability .6, loses (L) with probability .3 
and ties (T) with probability .1. The team plays three games over the weekend, 
(i) Determine the elements of the event A that the team wins at least twice and doesn’t 
lose; and find P(A). (ii) Determine the elements of the event B that the team wins, 
loses and ties; and find P(B). 

(i) A consists of all ordered triples with at least 2 W’s and no L’s. Thus 

A = (WWW, WWT, WTW, TWW} 

Furthermore, P(A) = P( WWW) + P(WWT) + P(WTW) + P(TWW) 

= (*6)(.6)(.6) + (.6)(.6)(.l) + (.6)(.1)(.6) + (.1)(.6)(.6) 

= .216 + .036 4- .036 + .036 = .324 

(ii) Here B = (WLT, WTL, LWT, LTW, TWL, TLW}. Since each element of B has probability 
(.6)(.3)(.l) = .018, P(B) = 6(.018) = .108. 


4.30. Let S be a finite probability space and let T be the probability space of n independent 
trials in S. Show that T is well defined; that is, show (i) the probability of each 
element of T is nonnegative and (ii) the sum of their probabilities is 1. 

If 5 — { ai> .. mj a r }, then T can be represented by 

T = (a- • -a in : i l9 .. ., i n = 1, .. .,r} 

Since P(a t ) — 0, we have 

?(%'•'%) = P(%)***P(%) - 0 


for a typical element * • • a in in T, which proves (i) 


We prove (ii) by induction on n . It is obviously true for n = 1. Therefore we consider n > 1 
and assume (ii) has been proved for n — 1. Then 


2 P(<V«i) = . 2 *(«,>••*(«.) = . 2 P(ai)---P( ai ) 2 P(%) 

i v ...,i n = l 1 " *1. *„ = 1 1 * = l 1 ” 1 l n =1 


2 P(a .)■■■ 

H . 4 n-l = 1 




by the inductive hypothesis, which proves (ii) for n. 



P(«Y 


1 
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Supplementary Problems 


CONDITIONAL PROBABILITY 

4.31. A die is tossed. If the number is odd, what is the probability that it is prime? 

4.32. Three fair coins are tossed. If both heads and tails appear, determine the probability that exactly 
one head appears. 

4.33. A pair of dice is tossed. If the numbers appearing are different, find the probability that the sum 
is even. 

4.34. A man is dealt 5 red cards from an ordinary deck of 52 cards. What is the probability that they 
are all of the same suit, i.e. hearts or diamonds? 

4.35. A man is dealt 3 spade cards from an ordinary deck of 52 cards. If he is given four more cards, 
determine the probability that at least two of the additional cards are also spades. 

4.36. Two different digits are selected at random from the digits 1 through 9. 

(i) If the sum is odd, what is the probability that 2 is one of the numbers selected? 

(ii) If 2 is one of the digits selected, what is the probability that the sum is odd? 

4.37. Four persons, called North, South, East and West, are each dealt 13 cards from an ordinary deck 
of 52 cards. 

(i) If South has exactly one ace, what is the probability that his partner North has the other 
three aces? 

(ii) If North and South together have 10 hearts, what is the probability that either East or 
West has the other 3 hearts? 

4.38. A class has 10 boys and 5 girls. Three students are selected from the class at random, one after 
the other. Find the probability that (i) the first two are boys and the third is a girl, (ii) the first 
and third are boys and the second is a girl, (iii) the first and third are of the same sex, and the 
second is of the opposite sex. 

4.39. In the preceding problem, if the first and third students selected are of the same sex and the second 
student is of the opposite sex, what is the probability that the second student is a girl? 

4.40. In a certain town, 40% of the people have brown hair, 25% have brown eyes, and 15% have both 
brown hair and brown eyes. A person is selected at random from the town. 

(i) If he has brown hair, what is the probability that he also has brown eyes? 

(ii) If he has brown eyes, what is the probability that he does not have brown hair? 

(iii) What is the probability that he has neither brown hair nor brown eyes? 

4.41. Let A and B be events with P{A) — P(B) = £ and P(A \JB) — -J. Find (i) P(A \ B), 

(ii) P(B \ A), (iii) P(AnB c ), (iv) P(A | B *). 

4.42. Let S = {a, 6, c, d, e,/} with P{a) = P(b) = P(c) = P(d) = 3^-, P(e) = £ and 

P(f) - JL. Let A = {a, c, e}, B = {c,d,e,/} and C = {b,c,f}. Find (i) P(A \ B), (ii) P(B | C), 

(iii) P{C | Ac), (iv) P(A C | C). 

4.43. In a certain college, 25% of the boys and 10% of the girls are studying mathematics. The girls 

constitute 60% of the student body. If a student is selected at random and is studying mathematics, 

determine the probability that the student is a girl. 
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FINITE STOCHASTIC PROCESSES 

4.44. We are given two urns as follows: 

Urn A contains 5 red marbles, 3 white marbles and 8 blue marbles. 

Urn B contains 3 red marbles and 5 white marbles. 

A fair die is tossed; if 3 or 6 appears, a marble is chosen from B t otherwise a marble is chosen 
from A. Find the probability that (i) a red marble is chosen, (ii) a white marble is chosen, 
(iii) a blue marble is chosen. 

4.45. Refer to the preceding problem, (i) If a red marble is chosen, what is the probability that it came 
from urn A? (ii) If a white marble is chosen, what is the probability that a 5 appeared on the die? 

4.46. An urn contains 5 red marbles and 3 white marbles. A marble is selected at random from the urn, 
discarded, and two marbles of the other color are put into the urn. A second marble is then selected 
from the urn. Find the probability that (i) the second marble is red, (ii) both marbles are of the 
same color. 

4.47. Refer to the preceding problem, (i) If the second marble is red, what is the probability that the 
first marble is red? (ii) If both marbles are of the same color, what is the probability that they 
are both white? 

4.48. A box contains three coins, two of them fair and one two-headed. A coin is selected at random 
and tossed twice. If heads appears both times, what is the probability that the coin is two-headed? 

4.49. We are given two urns as follows: 

Urn A contains 5 red marbles and 3 white marbles. 

Urn B contains 1 red marble and 2 white marbles. 

A fair die is tossed; if a 3 or 6 appears, a marble is drawn from B and put into A and then a marble 
is drawn from A; otherwise, a marble is drawn from A and put into B and then a marble is drawn 
from B. 

(i) What is the probability that both marbles are red? 

(ii) What is the probability that both marbles are white? 

4.50. Box A contains nine cards numbered 1 through 9, and box B contains five cards numbered 1 

through 5. A box is chosen at random and a card drawn; if the card shows an even number, 

another card is drawn from the same box; if the card shows an odd number, a card is drawn from 
the other box. 

(i) What is the probability that both cards show even numbers? 

(ii) If both cards show even numbers, what is the probability that they come from box A? 

(iii) What is the probability that both cards show odd numbers? 

4.51. A box contains a fair coin and a two-headed coin. A coin is selected at random and tossed. If 
heads appears, the other coin is tossed; if tails appears, the same coin is tossed. 

(i) Find the probability that heads appears on the second toss. 

(ii) If heads appeared on the second toss, find the probability that it also appeared on the first toss. 

4.52. A box contains three coins, two of them fair and one two-headed. A coin is selected at random 

and tossed. If heads appears the coin is tossed again; if tails appears, then another coin is selected 
from the two remaining coins and tossed. 

(i) Find the probability that heads appears twice. 

(ii) If the same coin is tossed twice, find the probability that it is the two-headed coin. 

(iii) Find the probability that tails appears twice. 

4.53. Urn A contains x red marbles and y white marbles, and urn B contains z red marbles and v white 
marbles. 

(i) If an urn is selected at random and a marble drawn, what is the probability that the marble 
is red? 

(ii) If a marble is drawn from urn A and put into urn B and then a marble is drawn from urn B , 
what is the probability that the second marble is red? 
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4.54. A box contains 5 radio tubes of which 2 are defective. The tubes are tested one after the other 
until the 2 defective tubes are discovered. What is the probability that the process stopped on the 

(i) second test, (ii) third test? 

4.55. Refer to the preceding problem. If the process stopped on the third test, what is the probability 
that the first tube is nondefective? 

INDEPENDENCE 

4.56. Prove: If A and B are independent, then A and B c are independent and A c and B are independent. 

4.57. Let A and B be events with P(A) = P(AuB) = £ and P(B) = p. (i) Find p if A and B are 
mutually exclusive, (ii) Find p if A and B are independent, (iii) Find p if A is a subset of B. 

4.58. Urn A contains 5 red marbles and 3 white marbles, and urn B contains 2 red marbles and 
6 white marbles. 

(i) If a marble is drawn from each urn, what is the probability that they are both of the same 
color? 

(ii) If two marbles are drawn from each urn, what is the probability that all four marbles are 
of the same color? 

4.59. Let three fair coins be tossed. Let A = {all heads or all tails}, B = (at least two heads} and 
C = (at most two heads}. Of the pairs (A,B), ( A,C) and (B, C), which are independent and 
which are dependent? 

4.60. The probability that A hits a target is £ and the probability that B hits a target is 

(i) If each fires twice, what is the probability that the target will be hit at least once? 

(ii) If each fires once and the target is hit only once, what is the probability that A hit the target? 

(iii) If A can fire only twice, how many times must B fire so that there is at least a 90% proba¬ 
bility that the target will be hit? 

4.61. Let A and B be independent events with P(A) = and P(AuB) = §. Find (i) P(B), (ii) P(A \ B) t 

(iii) P(B C | A). 

4.62. Suppose A, B, C are independent events. Show that any of the combinations 

A c , B, C; A,B c ,C ; ...; A c , B c , C; ...; A c ,B c ,C c 
are also independent. Furthermore, show that A and BuC are independent; and so forth. 


INDEPENDENT TRIALS 

4.63. A rifleman hits (H) his target with probability .4, and hence misses (M) with probability .6. He 
fires four times, (i) Determine the elements of the event A that the man hits the target exactly 
twice; and find P(A). (ii) Find the probability that the man hits the target at least once. 

4.64. A team wins (W) with probability .5, loses (L) with probability .3 and ties (T) with probability .2. 
The team plays twice, (i) Determine the sample space S and the probabilities of the elementary 
events, (ii) Find the probability that the team wins at least once. 

4.65. Consider a countably infinite probability space S = {a v a 2 , ...}. Let 

T = S n = {($!, s 2 , .. s n ) : s { eS} 
and let P( s i> s 2 > • • •> s n) = P( s i) P( s z) ’ ' * P( s n) 

Show that T is also a countably infinite probability space. (This generalizes the definition (page 58) 
of independent trials to a countably infinite space.) 
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Answers to Supplementary Problems 


4.31. | 

4.32. | 

4.33. § 


4.34. 



9 

230 


4.40. 

(i) b 

(«) b 

(iii) i 


4.41. 

(i) h 

(ii) b 

(iii) i. 

(iv)i 

4.42. 

(i) f, 

(“) h 

(iii) §, 

(iv) | 


4.43. | 



4.44. (i) £ 

(ii) i 

(iii) * 

4 - 45 - (Of (ii) ^ 

4.46. <i)£, (ii)§ 
4-47. (i)fi, (ii) ^ 
4.48. | 




4.51. (i)f, (ii) | 
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Tree diagram for Problem 4.52 Tree diagram for Problem 4.54 


(i) ifcf7 + 377). («) 


xz + x + yz 
<* + y)(* + v+l) 


4.54. (i) (ii) we must include the case where the three nondefective tubes appear first, since 

the last two tubes must then be the defective ones. 


4.55. | 

4.57. (i) jL, (ii) A, (iii)i 

4.58. (i)&, (ii) Ilf 

4.59. Only A and B are independent. 


4.60. (i) §, (ii) f, (iii) 5 

4.61. (i) 1, (ii) (iii) § 

4.63. (i) A = {HHMM, HMHM, HMMH, MHHM, MHMH, MMHH), P(A ) = .3456 

(ii) 1 - (.6)4 = .8704 

4.64. (i) S = {WW, WL, WT, LW, LL, LT, TW, TL, TT} 

(ii) .75 



Chapter 5 


Random Variables 


INTRODUCTION 

We recall the concept of a function. Let S and T be arbitrary sets. Suppose to each 
s G S there is assigned a unique element of T\ the collection / of such assignments is called 
a function (or: mapping or map) from S into T, and is written /: S T. We write f(s) for 
the element of T that / assigns to s G S, and call it the image of s under / or the value of / 
at s. The image f(A) of any subset A of S, and the preimage / -1 ( B ) of any subset B of T 
are defined by 

f(A) - {/(*) : s e A} and Z” 1 (. B ) = {s : f(s) G B} 

In words, f(A) consists of the images of points of A and / -1 (B) consists of those points whose 
images belong to B. In particular, the set f(S) of all the image points is called the image set 
(or: image or range) of /. 

Now suppose S is the sample space of some experiment. As noted previously, the 
outcomes of the experiment, i.e. the sample points of S, need not be numbers. However, 
we frequently wish to assign a specific number to each outcome, e.g. the sum of the points 
on a pair of dice, the number of aces in a bridge hand, or the time (in hours) it takes for a 
lightbulb to burn out. Such an assignment is called a random variable; more precisely, 

Definition: A random variable X on a sample space S is a function from S into the set R 

of real numbers such that the preimage of every interval of R is an event of S. 

We emphasize that if S is a discrete space in which every subset is an event, then every 
real-valued function on S is a random variable. On the other hand, it can be shown that if 
S is uncountable then certain real-valued functions on S are not random variables. 

If X and Y are random variables on the same sample space S, then X + Y, X + k, kX and 
XY (where k is a real number) are the functions on S defined by 

(X + Y)(s) = X(s) + Y(s) (kX)(s) = kX(s) 

(X + k)(s) = X(s) 4- k ( XY){s) = X(s) Y(s) 

for every s G S. It can be shown that these are also random variables. (This is trivial in 
the case that every subset of S is an event.) 

We use the short notation P(X = a) and P(a — X ^ b) for the probability of the events 
“X maps into a” and “X maps into the interval [a, 6].” That is, 

P(X = a) = P({s G S : X(s) = a}) 

and P(a^X^b) = P({s GS: a^ X(s) ^ b}) 

Analogous meanings are given to P(X ~ a), P(X = a, Y = 6), P(a ^ X ^ b, c^Y^d), etc. 
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DISTRIBUTION AND EXPECTATION OF A FINITE RANDOM VARIABLE 

Let X be a random variable on a sample space S with a finite image set; say, 
X(S) = {Xi,X 2 , . . We make X(S) into a probability space by defining the probability 

of x { to be P(X~xi) which we write f{xi). This function / on X(S), i.e. defined by 
f(xi) = P(X — Xi), is called the distribution or probability function of X and is usually given 
in the form of a table: 


*1 

x 2 


x n 

/(* l) 

f(x 2 ) 


f(r„) 


The distribution / satisfies the conditions 

(i) f(Xi) ^ 0 and (ii) 2 f( x d = 1 

i= 1 

Now if X is a random variable with the above distribution, then the mean or expectation 
(or: expected value) of X, denoted by E(X) or n x , or simply E or ^ is defined by 

E(X) - xif(xi) + x 2 f(x 2 ) + •** + Xnf(Xn) = 2 

i=l 

That is, E(X) is the weighted average of the possible values of X, each value weighted by its 
probability. 


Example 5.1: A pair of fair dice is tossed. We obtain the finite equiprobable space S consisting 

of the 36 ordered pairs of numbers between 1 and 6: 

S = {(1,1), (1,2), (6,6)} 

Let X assign to each point (a, 6) in S the maximum of its numbers, i.e. X(a, b) = 
max (a, 6). Then X is a random variable with image set 

X(S) = {1, 2, 3, 4, 5, 6} 

We compute the distribution / of X: 

/(l) = -P(* = i) = R(((l,l)}) = £ 

/(2) = P(X = 2) = P({(2,1), (2,2), (1,2)}) = ^ 

/(3) = P(V = 3) = P({(3,1), (3,2), (3,3), (2,3), (1,3)}) = ^ 

/(4) = P(X = 4) = P({(4,1), (4,2), (4, 3), (4,4), (3,4), (2,4), (1,4)}) = X 

Similarly, 

m = ^ = 5) = £ and /(6) = P{X = B) = § 

This information is put in the form of a table as follows: 


X i 

1 

2 

3 

4 

5 

6 

/(* i) 

J_ 

3 

5 

7_ 

9 

n 

36 

36 

36 

36 

36 

36 


We next compute the mean of X: 

2*i/(*j) = l’k+ 2-^ 
= 4.47 


E{X) 


+ 3 ' 


i + 4 *s5 + 6 


- + fi.ll 

OR I U oa 


161 

36 


Now let Y assign to each point (a, b) in S the sum of its numbers, i.e. F(a, b) = 
a + 6. Then Y is also a random variable on S with image set 

Y(S) = {2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12} 
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Example 5.2: 


The distribution g of Y follows: 


Vi 

2 

3 

4 

5 

6 

7 

8 

9 

10 

11 

12 

a(y ;) 

l 

2 

3 

4 

5 

6 

5 

4 

3 

2 

1 

36 

36 

36 

36 

36 

36 

36 

36 

36 

36 

36 


We obtain, for example, g{ 4) = ^ from the fact that (1,3), (2,2), and (3,1) are 
those points of S for which the sum of the components is 4; hence 

g( 4) = P(Y = 4) = P({(1,3), ( 2 , 2 ), (3, 1 )}) = jg 

The mean of Y is computed as follows: 

E(Y) = ^y i9 ( Vi ) = 2.£+3.£+ ■•• + 12.£ - 7 

The charts which follow graphically describe the above distributions: 



Distribution of X 



Observe that the vertical lines drawn above the numbers on the horizontal axis are 
proportional to their probabilities. 


A coin weighted so that P(H) = § and P(T) — 1 is tossed three times. The 
probabilities of the points in the sample space S = {HHH, HHT, HTH, HTT, 
THH, THT, TTH, TTT} are as follows: 


P(HHH) = |.|-t = £ P(THH) = i-l-f = ± 

P(HHT) = f.f.l = ± P(THT) = i-f.i = £ 

p ( HTH) = f-i-i = P( TTH) = = £ 

P(HTT) = f-J-i = £ P(TTT) = 1-i-i = jf 

Let X be the random variable which assigns to each point in S the largest 
number of successive heads which occurs. Thus, 

Z(TTT) - 0 

Z(HTH) = 1, X(HTT) = 1, Z(THT) = 1, Z(TTH) = 1 
Z(HHT) = 2, Z(THH) = 2 
Z(HHH) = 3 


The image set of X is X(S) — (0,1, 2, 3}. We compute the distribution / of X: 
/( 0) = P(TTT) = i 

/(l) = P({HTH, HTT, THT, TTH}) = ^ + | + | + | = § 

/(2) = P({HHT, THH}) = =' A 

/(3) - P(HHH) = i 
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This information is put in the form of a table as follows: 



0 

1 

2 

3 

/(*i) 

1 

27 

10 

27 

8 

27 

8 

27 


The mean of X is computed as follows: 

E(X) = 2 *,/(*,) = 0-£ + l-§ + 2-§+ 8-i = §=1.85 


Example 5.3: A sample of 3 items is selected at random from a box containing 12 items of which 
3 are defective. Find the expected number E of defective items. 

12 


The sample space S consists of the f ) = 220 distinct equally likely sam¬ 
ples of size 3. We note that there are: ' d ' 


= 84 samples with no defective items; 

= 108 samples with 1 defective item; 

* 9 = 27 samples with 2 defective items; 

3\ 

J = 1 sample with 3 defective items. 

Thus the probability of getting 0, 1, 2 and 3 defective items is respectively 84/220, 
108/220, 27/220 and 1/220. Thus the expected number E of defective items is 


E = 


0 • — 4- 1 . 
u o on “ A 


108 

220 


4 . 9 • — 4- 3 * 

' ** 99n i ° 


1 __ _ 

220 ' x ' 220 ^ ' 220 _r ° ’ 220 ~ 220 — 

Remark: Implicitly we have obtained the expectation of the random variable X 
which assigns to each sample the number of defective items in the sample. 

In a gambling game, the expected value E of the game is considered to be the value of 
the game to the player. The game is said to be favorable to the player if E is positive, and 
unfavorable if E is negative. If E = 0, the game is fair. 


165 

220 


Example 5.4: A player tosses a fair die. If a prime number occurs he wins that number of 

dollars, but if a non-prime number occurs he loses that number of dollars. The 
possible outcomes x { of the game with their respective probabilities /(a^) are as 
follows: 



2 

3 

5 

-1 

-4 

—6 

/(*<) 

i 

6 

1 

6 

i 

1 

6 

6 

i 


The negative numbers —1, —4 and —6 correspond to the fact that the player loses 
if a non-prime number occurs. The expected value of the game is 

E = 2.J + 8-* + 6.*- - 6-* = -* 

Thus the game is unfavorable to the player since the expected value is negative. 

Our first theorems relate the notion of expectation to operations on random variables. 

Theorem 5.1: Let X be a random variable and k a real number. Then (i) E(kX) = kE(X) 
and (ii) E(X + k) - E(X) + k. 

Theorem 5.2: Let X and Y be random variables on the same sample space S. Then 
E{X+Y) = E{X) + E{Y). 
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A simple induction argument yields 
Corollary 5.3: Let Ai, A 2 , .. ., A n be random variables on S. Then 

E(X i + • • • + A,) = E{Xi) + ■ • • + E(X n ) 


VARIANCE AND STANDARD DEVIATION 

The mean of a random variable X measures, in a certain sense, the “average” value of 
X. The next concept, that of the variance of X, measures the “spread” or “dispersion” of X. 

Let X be a random variable with the following distribution: 


*1 

x 2 



/(* l) 

/(ass) 


/(*„) 


Then the variance of X, denoted by Var (A*), is defined by 

Var (A) = 2 (Xi — p) 2 f(Xi) = E((X — p) 2 ) 

i=l 

where p, is the mean of X. The standard deviation of X, denoted by <r x , is the (nonnegative) 

square root of Var (A): _ 

<*x = V Var (AT) 

The next theorem gives us an alternate and sometimes more useful formula for cal¬ 
culating the variance of the random variable X. 

Theorem 5.4: Var (A) = 2 *?/(*«) - M 2 = E(X°-) - ,P. 

i= 1 

Proof. Using 2 *•/(*<) = /* and 2/(**) = 1* we have 

2 (*< - nff{ X i) - 2 ( X i ~ 2 V X i + M 2 ) f( x <) 

= 2 X i f( x 0 — 2j«. 2 Xif&i) + M 2 2 /(*■ i) 

= 2 x if( x i) - 2 ^ 2 + a * = 2 /(*«) _ m 2 

which proves the theorem. 


Example 5.5: Consider the random variable V of Example 5.1 (which assigns the maximum of 

the numbers showing on a pair of dice). The distribution of X is 



1 

2 

3 

4 

5 

6 

fW 

1 

3 

5 

_7 

9 

11 

36 

36 

36 

36 

36 

36 


and its mean is fix — 4.47. 
First we compute E(X 2 ): 

e(x 2 ) = 2 = 


Hence 

Var (X) = E(X 2 ) - & = 


We compute the variance and standard deviation of X. 


12*— 4- 92.A-L Q2 • JL _I_ 42 .X-L 52 . JL 4 . 6 2 » — 


1 = 21 -97 


21.97 - 19.98 = 1.99 and <r x = \/X99 = 1.4 


Now consider the random variable Y of Example 5.1 (which assigns the sum 
of the numbers showing on a pair of dice). The distribution of Y is 
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Vi 

2 

3 

4 

5 

6 

7 

8 

9 

10 

11 

12 

div i) 

1 

36 

2 

36 

3 

36 

4 

36 

5 

36 

6 

36 

5 

36 

4 

36 

3 

36 

2 

36 

1 

i 36 


and its mean is fi Y — 7. We compute the variance and standard deviation of Y. 
First we compute E(Y 2 ): 

E(Y*) = Svfrtv i) = 22-i+32.|+ + 12* = W = 54.8 

Hence 

Var (Y) = E(Y 2 ) - 4 = 54.8 - 49 = 5.8 and <r y = VO = 2.4 
We establish some properties of the variance in 

Theorem 5.5: Let X be a random variable and k a real number. Then (i) Var (X + k) = 
Var (X) and (ii) Var(fcX) = /c 2 Var(V). Hence <r x+k = <r x and a- kX = \k\<r x . 

Remark 1. There is a physical interpretation of mean and variance. Suppose at each 
point Xi on the x axis there is placed a unit with mass /(*<). Then the mean is 
the center of gravity of the system, and the variance is the moment of inertia 
of the system. 

Remark 2. Many random variables give rise to the same distribution; hence we frequently 
speak of the mean, variance and standard deviation of a distribution instead 
of the underlying random variable. 


Remark 3. Let X be a random variable with mean y and standard deviation a > 0. The 
standardized random variable X* corresponding to X is defined by 

/£ 

<7 

We show (Problem 5.23) that E(X*) = 0 and Var(V*) = 1. 


JOINT DISTRIBUTION 

Let X and Y be random variables on a sample space S with respective image sets 
X(S) = {xi, x 2 , ..., x„} and Y(S) = {y u y 2 , ..., y m } 

We make the product set 


i 


X(S) x Y(S) = {(*1,2/1), (*1,2/2), . . ., (*n, 2/m)} 

into a probability space by defining the probability of the ordered pair y,) to be 
P(X = Xi, Y = 2/j) which we write h(Xi, y,). This function h on X(S) x Y(S), i.e. defined by 
h(Xi, y,) = P(X — x it Y = 2 /j), is called the joint distribution or joint probability function of 
X and Y and is usually given in the form of a table: 
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The above functions f and g are defined by 

m n 

f(Xi) = 2 Vi) and d(yi) = 2 3/j) 

i=l i=l 

i.e. is the sum of the entries in the ith row and g(yj) is the sum of the entries in the jth 
column; they are called the marginal distributions and are, in fact, the (individual) distribu¬ 
tions of X and Y respectively (Problem 5.12). The joint distribution h satisfies the 
conditions n m 

(i) h(Xi, 2 /j) — 0 and (ii) X 2 Vi) = 1 

i= 1 3 — 1 

Now if X and Y are random variables with the above joint distribution (and respective 
means and /u Y ), then the covariance of X and Y, denoted by Cov ( X , Y), is defined by 

Cov (X, Y) = '2(* i -H x )(V,-l*r)H* l ,V } ) = ^ - Mx )(Y - My )] 

ij 

or equivalently (see Problem 5.18) by 

Cov ( X, Y) = 2 2/j) - MxA'-y = E(XY) - 

The correlation of X and Y, denoted by p(X, Y), is defined by 


p(X, Y) = Covi^Y) 

(T x (Ty 

The correlation p is dimensionless and has the following properties: 

(i) P (X, Y) = P ( Y, X) (iii) P (X, X) = 1, P (X, -X) = -1 

(ii) -1 ^ P ^ 1 (iv) p(aX + b,cY + d) = P (X, Y), if a, c ¥- 0 

We show below (Example 5.7) that pairs of random variables with identical (individual) 
distributions can have distinct covariances and correlations. Thus Cov ( X , Y) and p{X, Y) 
are measurements of the way that X and Y are interrelated. 


Example 5.6: A pair of fair dice is tossed. We obtain the finite equiprobable space S consisting 
of the 36 ordered pairs of numbers between 1 and 6: 

5 = {( 1 , 1 ), ( 1 , 2 ), ...,( 6 , 6 )} 

Let X and Y be the random variables on S in Example 5.1, i.e. X assigns the maxi¬ 
mum of the numbers and Y the sum of the numbers to each point of 5. The joint 
distribution of X and Y follows: 


\y 











1 

Sum 

X \ 

2 

3 

4 

5 

6 

7 

8 

9 

10 

11 

12 

1 

i 

36 

o 

0 

0 

0 

0 

0 

0 

0 

0 

0 

i 

36 

2 

0 

2 

36 

l 

36 

0 

0 

0 

0 

0 

0 

0 

0 

3 

36 

3 

0 

0 

2 

36 

2 

36 

1 

36 

0 

0 

0 

0 

0 

0 

5 

36 

4 

0 

0 

0 

2 

36 

2 

36 

2 

36 

1 

36 

0 

0 

0 

0 

7 

36 

5 

0 

0 

0 

0 

2 

2 

2 

2 

1 

0 

0 

9 


36 

36 

36 

36 

36 

36 

6 

0 

0 

0 

0 

0 

2 

36 

2 

36 

2 

36 

2 

! 36 

2 

36 

1 

36 

11 

36 

Sum 

1 

2 

3 

4 

5 

6 

5 

4 

3 

2 

1 


36 

1 36 

36 

i- 

36 

36 

36 

36 

! 36 

1 _ 

36 

36 

: 36 
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The above entry h( 3,5) = comes from the fact that (3,2) and (2,3) are the 
only points in S whose maximum number is 3 and whose sum is 5; hence 

h( 3,5) = P(X = 3, F = 5) = P({(3,2), (2,3)}) = | 

The other entries are obtained in a similar manner. 

We compute the covariance and correlation of X and Y. First we compute E(XY ): 
E(XY) = 2 x i y i h(x j, Vj) 


= 1 -2 ■ 

__ 1232 

“ 36 


+ 2 ' 3 -- + 2 ' 4 -55 + ••• + 6 - 12 -^ 
34.2 


By Example 5.1, n x - 4.47 and n Y = 7, and by Example 5.5, a x = 1.4 and 
a Y = 2.4; hence 


and 


Cov ( X , y) = E(XY) - fxxfiy = 34.2 - (4.47)(7) = 2.9 
Cov (X, Y) _ 2.9 


P(X, Y) 


CT X CTy 


(1.4)(2.4) 


= .86 


Example 5.7 : Let X and Y , and X' and Y r be random variables with the following joint dis¬ 

tributions: 


N \ y 

4 

10 

Sum 

1 

i 

i 

2 

3 

i 

i 


Sum 

i 

i 



^\y' 

4 

10 

Sum 

i 

0 

i 

i 

3 

i 

0 

i 

Sum 





Observe that X and X', and Y and Y' have identical distributions: 


Vi 

— 

4 

10 

g{y t) 





1 

3 

/(*,) 

1 

2 

i 


Distribution of X and X’ Distribution of Y and Y r 

We show that Cov (X, Y) # Cov (X', Y r ) and hence p(X, y) ¥* p(X' t Y f ). We first 
compute E(XY) and E(X f Y'): 

E(XY) ~ l*4«J + l*10*J+8-4*J + 8-10-J = 14 
E(X'Y') = 1-4*0 + 1 • 10 • J + 3 • 4 • £ + 3-10-0 = 11 
Since = Mx' = 2 and p Y ~ Py' ~ 

Cov ( X , Y) = E(XY) ~ fx xf i Y = 0 and Cov (X', Y') = E(X'Y') - = -3 

Remark: The notion of a joint distribution h is extended to any finite number of random 
variables X,Y, .. , 9 Z in the obvious way; that is, h is a function on the product 
set X(S) xY(S)x • • • x Z(S) defined by 

h(Xi, y it ..., z k ) ~ P(X = Xi , Y = y h ..., Z - z k ) 

INDEPENDENT RANDOM VARIABLES 

A finite number of random variables X, Y, ..., Z on a sample space S are said to be 
independent if 


P(X = x if Y = y h ...,Z = z k ) = P(X = Xi)P(Y = yj )---P(Z = z k ) 
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for any values x it y h ..., z k . In particular, X and Y are independent if 

P(X — Xi, Y = y s ) = P(X = Xi ) P(Y = lu) 

Now if X and Y have respective distributions / and g, and joint distribution h, then the above 
equation can be written as 

h{x u yj) - f(Xi) g(y } ) 

In other words, X and Y are independent if each entry h(x it yi) is the product of its marginal 
entries. 


Example 5.8: Let X and Y be random variables with the following joint distribution: 


y 

2 

3 

4 

Sum 






i 

.06 

.15 

.09 

.30 

2 

.14 

i 

.35 

.21 

.70 

Sum 

.20 

.50 

.30 



Thus the distributions of X and Y are as follows: 


X and Y are independent random variables since each entry of the joint distribu¬ 
tion can be obtained by multiplying its marginal entries; that is, 

P(X = x it Y = yj ) = P(X = x i )P(Y = y j ) 

for each i and each j. 

We establish some important properties of independent random variables which do not 
hold in general; namely, 

Theorem 5.6: Let X and Y be independent random variables. Then: 

(i) E(XY) = E(X)E{Y) f 

(ii) Var (X+Y) = Var (X) + Var (Y), 

(iii) Cov ( X , Y) — 0. 

Part (ii) in the above theorem generalizes to the very important 

Theorem 5.7: Let Xi, X 2 , .. ., X n be independent random variables. Then 

Var (Xi + *•* + X n ) = Var(Xi) + •■• + Var(X») 


X 

1 

2 

fix) 

.30 

.70 


Distribution of X 


y 

2 

3 

4 

s(y) 

.20 

.50 

.30 


Distribution of Y 


FUNCTIONS OF A RANDOM VARIABLE 

Let X and Y be random variables on the same sample space S. Then Y is said to be a 
function of X if Y can be represented Y = <*>(X) for some real-valued function $ of a real 
variable; that is, if Y(s) = <J>[X(s)] for every s G S. For example, kX, X 2 , X + k and (A + k) 2 
are all functions of X with <l>(x) = kx, x 2 , x + k and (x + k) 2 respectively. We have the 
fundamental 
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Theorem 5.8: 


Let X and Y be random variables on the same sample space S with 
Y = $>(X). Then 

E(Y) = £*(*i)/(* i) 

i— 1 

where / is the distribution function of X. 


Similarly, a random variable Z is said to be a function of X and Y if Z can be represented 
Z = $(X, Y) where 3> is a real-valued function of two real variables; that is, if 

Z(S) = *[X{8),Y{8)] 

for every s G S. Corresponding to the above theorem, we have 

Theorem 5.5: Let X , Y and Z be random variables on the same sample space S with 
Z = *(X,Y). Then 

E(Z) = 2 $(*«» Vi) h ( x u Vi) 

where h is the joint distribution of X and Y. 

We remark that the above two theorems have been used implicitly in the preceding dis¬ 
cussion and theorems. We also remark that the proof of Theorem 5.9 is given as a supple¬ 
mentary problem, and that the theorem generalizes to a function of n random variables in 
the obvious way. 


DISCRETE RANDOM VARIABLES IN GENERAL 

Now suppose I is a random variable on S with a countably infinite image set; say 
X(S) = {Xi, X 2 ,..Such random variables together with those with finite image sets 
(considered above) are called discrete random variables. As in the finite case, we make 
X(S) into a probability space by defining the probability of Xi to be f(x i) = P(X = Xi) and 
call / the distribution of X: 


*1 

z 2 

x 3 

... 

/(* l) 


/(* 3) 

... 


The expectation E(X) and variance Var (X) are defined by 

E(X) = xi f(x i) + x 2 f(x 2 ) + * * • - ^ Xi f(Xi) 

t=i 

Var (X) = (xi - f(xi) + (x 2 - n) 2 f(x 2 ) + ••• = 2 (®i - m) 2 ffa) 

i= 1 

when the relevant series converge absolutely. It can be shown that Var(X) exists if and 
only if fx = E(X) and E(X 2 ) both exist and that in this case the formula 

Var (X) = £(X 2 )-^ 2 

is valid just as in the finite case. When Var (X) exists, the standard deviation a x is defined 

as in the finite case by __ 

°x = Vv^F(X) 

The notions of joint distribution, independent random variables and functions of random 
variables carry over directly to the general case. It can be shown that if X and Y are 
defined on the same sample space S and if Var(X) and Var(Y) both exist, then the series 




84 


RANDOM VARIABLES 


[CHAP. 5 


Cov (. X, Y) = 2 ( x . - l i x )(y j - /j,y) y ) 

i >3 

converges absolutely and the relation 

Cov(X,Y) = Zxfthixpy,) - w y = E(XY) - ^ 

i>3 

holds just as in the finite case. 

Remark: To avoid technicalities we will establish many theorems in this chapter only for 
finite random variables. 


CONTINUOUS RANDOM VARIABLES 

Suppose that X is a random variable whose 
image set X(S) is a continuum of numbers 
such as an interval. Recall from the definition 
of random variables that the set {a — X—b} 
is an event in S and therefore the probability 
P(a — X ft) is well defined. We assume that 
there is a piecewise continuous function 
/: R-»R such that P(a — X — 6) is equal to the 
area under the graph of f between x = a and 
x = b (as shown on the right). In the lan¬ 
guage of calculus, 

P(a — X ^ b) 

In this case X is said to be a continuous random variable. The function f is called the 
distribution or the continuous probability function (or: density function) of X; it satisfies 
the conditions 

(i) f(x) 3* 0 and (ii) C f(x) dx — 1 

That is, f is nonnegative and the total area under its graph is 1. 

The expectation E(X) is defined by 

E(X) = r x f(x) dx 

c/R 

when it exists. Functions of random variables are defined just as in the discrete case; and 
it can be shown that if Y = <J>(V), then 

E(Y) = r $(x)f(x)dx 
Jr 

when the right side exists. The variance Var (X) is defined by 

Var(X) = E((X — /i) 2 ) = T (x — ft.) 2 f(x) dx 

Jr 

when it exists. Just as in the discrete case, it can be shown that Var (X) exists if and only 
if u = E(X) and E(X 2 ) both exist and then 

= r x 2 f{x)dx - jj 2 

R 



P(a — X — b) — area of shaded region 


J b f(x) 


dx 


Var (X) 


E(X 2 ) - p 2 
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The standard deviation <r x is defined by <r x = yVar(X) when Var(X) exists. 

We have already remarked that we will establish many results for finite random 
variables and take them for granted in the general discrete case and in the continuous case. 


/<*> = 


Example 5.9: Let X be a continuous random variable 
with the following distribution: 

f±x if 0 ^ x ^ 2 
^0 elsewhere 

Then 

P( 1 — X — 1.5) m area of shaded 

region in diagram 

= *•*(* + *) = -h 



We next compute the expectation, variance and standard deviation of X: 


E(X) = J 

p x f(x) dx = 

R 

C \x 2 dx rx 

r*«T _ 4 

|_6 Jo 3 

E{X 2 ) = J 

P x 2 f(x) dx — 

f r 

j" ^ x 3 dx — 

m: - ^ 

= E(X 2 ) - 

i 

II 

IM 

= I and 

- = yjl = H 


A finite number of continuous random variables, say X, Y, .. .,Z, are said to be inde¬ 
pendent if for any intervals [a,a'], [b,b'], ..., [c,c'], 

P(a^X^a', b^Y^b’, .. .,c^Z^c’) = P{a^X^a')P(b^Y^b')-■ -P(c^Z^c') 

Observe that intervals play the same role in the continuous case as points did in the 
discrete case. 


CUMULATIVE DISTRIBUTION FUNCTION 

Let A be a random variable (discrete or continuous). The cumulative distribution func¬ 
tion F of X is the function F \ R-> R defined by 


F(a) = P{X — a) 

If X is a discrete random variable with distribution /, then F is the “step function’ 
defined by „ 

f(x) = s m 

Xi — X 

On the other hand, if A is a continuous random variable with distribution f, then 

F(x) = f f(t)dt 

«/ — CO 


In either case, F is monotonic increasing, i.e. 

F(a) — F(b) whenever a—b 
and the limit of F to the left is 0 and to the right is 1: 

Lim F(x) — 0 and Lim F(x ) = 1 

X —*■ “ 00 X-+OQ 
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Example 5.10: Let X be a discrete random variable with the following distribution: 



-2 

1 

2 

4 

/(*i) 


* 

£ 

£ 


The graph of the cumulative distribution function F of X follows: 



Graph of F 


Observe that F is a “step function” with a step at the x { with height /(a^). 


Example 5.11: Let X be a continuous random vari¬ 
able with the following distribution: 


/<*) 


\x if 0 — x — 2 
0 elsewhere 



The cumulative distribution function 
F and its graph follows: 


F(x) = 


0 if x < 0 

|x 2 if 0 — x — 2 
1 if x > 2 



TCHEBYCHEFF’S INEQUALITY. LAW OF LARGE NUMBERS 

The intuitive idea of probability is the so-called “law of averages”, i.e. if an event A 
occurs with probability p then the “average number of occurrences of A” approaches p as 
the number of (independent) trials increases. This concept is made precise by the Law of 
Large Numbers stated below. The proof of this theorem uses the well-known TchebychefFs 
inequality which follows: 

Theorem 5.10 (TchebychefFs inequality): Let I be a random variable with mean p, and 
standard deviation <r. Then for every e > 0, 

P(|X - H ^ 7 

Proof . We begin with the definition of variance: 

a* = Var(X) = 2 
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We delete all the terms in the above series for which | Xi — p\ < e. This does not increase the 
value of the series, since all its terms are nonnegative; that is, 


cr 2 ^ 2 * (Xi-tiffiXi) 
i 


where the asterisk indicates that the summation extends only over those i for which 
\Xi — ja| — €. Thus this new summation does not increase in value if we replace each |Xi — fi\ 


by «; that is, 


^ ^ « 2 f(%i) = c 2 S*/^) 

i i 


But 2* /(*t) is equal to the probability that \X — p.\ —«; hence 

" 2 ^ e 2 P( 


Dividing by t 2 we get the desired inequality. 


Theorem 5.11 (Law of Large Numbers): Let X lt X 2 , ... be a sequence of independent 
random variables with the same distribution with mean p and variance a 2 . 

Let — 

S n = (Xi + X 2 + • • • + X n )/n 

(called the sample mean). Then for any «> 0 

lim P{\S n - ji| — «) = 0 or equivalently lim P(\S n - < e) = 1 


Proof. Note first that 

„,o x _ E(X 0 + E(X 2 ) + • • • + E(X n ) _n»_ 

E(S n ) - - — - #* 

Since X u ..., X„ are independent, it follows from Theorem 5.7 that 

Var (Xi + • • • + X n ) = Var(Xi) + • • • + Var(X„) = no 2 

Therefore by Theorem 5.5(ii), 

Var (S n ) = Var ^ 1 -^ — ) = ^ Var (Xi + ■ • • +X n ) = = £ 

Thus by TchebychefTs inequality, 

P(\Sn-r-\^<) ~ —2 

The theorem now follows from the fact that the limit of the right side is 0 as n 00 . 

The following remarks are in order. 

Remark 1. We proved TchebychefTs inequality only for the discrete case. The con¬ 
tinuous case follows from an analogous proof which uses integrals instead 
of summations. 

Remark 2. We proved the Law of Large Numbers only in the case that the variance of the 
Xi exists, i.e. does not diverge. We note that the theorem is true whenever E(Xi) 
exists. 


Remark 3. The above Law of Large Numbers is also called the Weak Law of Large 
Numbers because of a similar, but stronger, theorem called the Strong Law 
of Large Numbers. 
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Solved Problems 

RANDOM VARIABLES AND EXPECTATION 



(i) fi = 2 »{/(*() = 2-1 + 3-1 + 11-1 = 4 
2 *?/(Zi) = 22-1 + 32-1 + 112-| = 26 

<r 2 = 2*-/(ic i ) - /i 2 = 26-16 = 10 

<r = \/l0 = 3.2 

(ii) ii = 2 *i /(*{) = “5*i - 4 • ^ + 1*£ + 2 • ^ = -1 
2®f/(*») = 25 • ^ + 16 4 + 1 • i + 4 • £ = 9.25 

a 2 => 2 f{*d ~ n 2 = 9.25 - 1 = 8.25 

a = V^25 = 2.9 

(iii) M = 2*i/(*i) = 1(.4) + 3(.l) + 4(.2) + 5(.3) = 3 
2 *•/(*{) = 1(.4) + 9(.l) + 16(.2) + 25(.3) = 12 

<r a - 2 x f f(%i) — m 2 = 12-9 - 3 

a = V3 = 1.7 

5.2. A fair die is tossed. Let X denote twice the number appearing, and let Y denote 1 or 3 
according as an odd or an even number appears. Find the distribution, expectation, 
variance and standard deviation of (i) X, (ii) Y, (iii) X + Y, (iv) XT. 

The sample space is S = {1,2, 3,4,5, 6}, and each number appears with probability 

(i) X(l) =* 2, X(2) = 4, X(S) = 6, X(4) = 8, X(5) = 10, X(6) = 12. Thus X(S) = {2, 4, 6,8, 10,12} 
and each number has probability Thus the distribution of X is as follows: 


x i 

2 

4 

6 

8 

10 

12 


i 

£ 

£ 


£ 

£ 


Accordingly, 

MX = E ( X ) = 2 x i f( x i) 

= 2-i + 4-i + 6-£ + B’i + 10-i + 12-i = f = 7 

2?(A2) = 2*f/(*i) 

= 4 *£ + 16 + 36 + 64 + 100*1 + 144*1 = 221 - 60-7 

= Var (A) = E{X 2 ) - M » = 

<r x = VTL7 = 3.4 


60.7 - (7)2 


11.7 
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(n) V(l) 1, F(2) - 3, F(3) - 1 , F(4) = 3, F(5) = 1 , Y(6) = 3. Hence Y(S) = {1,3} and 

ff(l) - P(Y = 1) = P({1,3,5}) =| = | an d ^(3) = P(F = 3) = P({2,4,6}) = | 
Thus the distribution of Y is as follows: 


Accordingly, 

Hr - E{Y) 


Vj 

1 

3 

g(Vj ) 

~T 



2 V) g(v } ) = 1 • £ + 3 • £ = 2 


= 2 y) g(y,) = 1 • £ + 9 • £ = 5 
4 = Var(F) = E(F2) - ^ = 5 - (2)2 = 1 

®v = VI = 1 


1 

2 


(iii) Using (X + F)(s) = X(s) + F(s), we obtain 

{X + F)(l) = 2 + 1 = 3 (X + F)(3) = 6+1 = 7 

(X+Y)(2) = 4 + 3 = 7 (Z + F)( 4 ) = 8 + 3 = 11 

Hence the image set is (X+ F)(S) = {3,7,11,15} and 3 and 
7 and 11 with probability §. That is, the distribution of X + 



3 

7 

11 

15 

p(«i) 

1 

6 

2 

6 

2 

6 

1 

6 


E(X + Y) = 3-|+ 7.|+ ll.|+ 15 .| = f = 9 
E((X+Y) 2) = 9 • j + 49 • | + 121 • | + 225 • 5 = = 95.7 

Var (X + F) = E({X+Y )*) - „* = 95.7 - 92 = 14.7 

a x+y = Vl4.7 = 3.8 

11.7+ b l S6 = + F) F) = 7 + 2 = 9 = E(X + F) ’ ^ Var <*> + Var < F ) = 


(AT + F)(5) = 10 + 1 = 11 

(X + F)(6) = 12 + 3 = 15 

15 occur with probability |, and 
F is as follows: 


(iv) Using (XY)(s) = Z(s) F(s), we obtain 

(^F)(l) = 2*1 = 2 (ZF)(3) = 6*1 = 6 (ZF)(5) = 10 • 1 = 10 

(XF)( 2) = 4*3 =12 (ZF)(4) = 8*3 = 24 (ZF)(6) = 12 • 3 = 36 

Hence the distribution of XY is as follows: 


U>i 

2 

6 

10 

12 

24 

36 

P(Wi) 

1 

6 

1 

¥ 

1 

i 

* 

i 


E(XY) = 2*^ + 6*^ + 10 *1+12 *^ + 24*1+ 36*^ = f = 15 

E((XY) 2 ) = 4*£ + 36*1 + 100 + 144*1 + 576 • £ + 1296 • i 

= = 359.3 

Var (ZF) = E((XY) 2) - p = 359.3 - I52 = 134.3 

a XY = V134.3 = 11.6 
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5.3. A coin weighted so that P(H) = £ and P(T) = £ is tossed three times. Let X be the 
random variable which denotes the longest string of heads which occurs. Find the 
distribution, expectation, variance and standard deviation of X. 

The random variable X is defined on the sample space 

S = (HHH, HHT, HTH, HTT, THH, THT, TTH, TTT} 

The points in S have the following respective probabilities: 

P(HHH) = £•£•£ = |A P(THH) 

P(HHT) = £•£•£ - ^ P(THT) 

P(HTH) = £•£•£ = & P(TTH) 

P(HTT) = £•£•£ = ^ P(TTT) 

Since X denotes the longest string of heads, 

AT(TTT) = 0; X(HTT) = 1, Z(HTH) = 1, X(THT) = 1, X(TTH) = 1; 

X(HHT) = 2, X(THH) = 2; X(HHH) = 3 


- 1 . a . 3 = _9_ 

4 4 4 6 4 

- i-f-i = A 
= i*i-i = 

= i ’ i * i = (Tf 


Thus the image set of X is X(S) = {0,1,2, 3). The probability f(x ( ) of each number x ( in X(S) is 
obtained by summing the probabilities of the points in S whose image is x { : 

f( 0) = P(TTT) = 

/(1) = P(HTT) + P(HTH) + P(THT) + P(TTH) = if 
/(2) = P(HHT) + P(THH) = If 
/(3) = P(HHH) = 1 1 

Accordingly, the distribution of X is as follows: 



0 

1 

2 

3 

/(*i> 

1 

64 

1 8 

64 

1 8 

64 

§i 


Thus 

M = E(X) = 0-£ + 1-H + 2-if + 3-|i = f = 2.1 
W) = 0*A + + 4 'H + 9*fi = W = 5.2 

a 2 = Var(X) = E(X 2 ) - M 2 = 5.2 - (2.1)2 = .g 

o- = v^8 = .9 


5.4. A fair coin is tossed until a head or five tails occurs. Find the expected number E 
of tosses of the coin. 

Only one toss occurs if heads occurs the first time, i.e. the event H. Two tosses occur if the first 
is tails and the second is heads, i.e. the event TH. Three tosses occur if the first two are tails and 
the third is heads, i.e. the event TTH. Four tosses occur if TTTH occurs, and five tosses occur if 
either TTTTH or TTTTT occurs. Hence 

/(l) = P(H) = i 

m = p(th) = i 

/(B) = P(TTH) = £ 

/(4) = P(TTTH) = jL 

/(5) = P(TTTTH) + P(TTTTT) = ± = A. 

Accordingly, E = l*^ + 2 >f + 3 > J + 4 *J 5 + 5 *JL = 2A = 1 . 9 . 
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5.5. Concentric circles of radius 1 and 3 inches are drawn 
on a circular target of radius 5 inches. A man re¬ 
ceives 10, 5 or 3 points according if he hits the target 
inside the smaller circle, inside the middle annular 
region or inside the outer annular region respectively. 
Suppose the man hits the target with probability | 
and then is just as likely to hit one point of the target 
as the other. Find the expected number E of points 
he scores each time he fires. 

The probability of scoring 10, 5, 3 or 0 points follows: 

fnm — I. area of 10 points _ 1 tt(1) 2 



Thus E = 10* 


m 

/( 3 ) 
/(0) 


■ + 5*^+3 


2 

area of target 

2 w( 5) 2 50 


1 b 

area of 5 points 

_ 1 . 3) 2 — 7r(l) 2 _ 

8 

2 

area of target 

2 jt(5) 2 

50 

1 

area of 3 points 

_ 1 . ,r(5) 2 - ,r(3) 2 

16 

2 ' 

area of target 

2 tt(5) 2 

50 


1 

2 

16 

50 


+ o 


, 1 __ 

’ 2 


= § = 1.96. 


5.6. A player tosses two fair coins. He wins $1 or $2 according as 1 or 2 heads appear. 
On the other hand, he loses $5 if no heads appear. Determine the expected value E 
of the game and if it is favorable to the player. 

The probability that 2 heads appear is J, that 2 tails (no heads) appear is \ and that 1 head 
appears is ^ Thus the probability of winning $2 is 1, of winning $1 is 1, and of losing $5 is 
Hence E = 2-J + l- i — 5*J = —J = —0.25. That is, the expected value of the game is minus 
25^, and so is unfavorable to the player. 


5.7. A player tosses two fair coins. He wins $5 if 2 heads occur, $2 if 1 head occurs and 
$1 if no heads occur, (i) Find his expected winnings, (ii) How much should he pay 
to play the game if it is to be fair? 

(i) The probability of winning $5 is of winning $2 is -J, and of winning $1 is hence 
E = 5*J + 2*-J + l*i = 2.50, that is, the expected winnings are $2.50. 

(ii) If he pays $2.50 to play the game, then the game is fair. 


JOINT DISTRIBUTIONS, INDEPENDENT RANDOM VARIABLES 

5.8. Suppose X and Y have the following joint distribution: 


\ y 

-3 

2 

4 

Sum 

i 

.1 

.2 

.2 

.5 

3 

.3 

.1 

.1 

.5 

Sum 

.4 

.3 

.3 



(i) Find the distributions of X and Y . 

(ii) Find Cov (X, Y), i.e. the covariance of X and Y . 

(iii) Find P (X, Y), i.e. the correlation of X and Y. 

(iv) Are X and Y independent random variables? 
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(i) The marginal distribution on the right is the distribution of X, and the marginal distribution 
on the bottom is the distribution of Y. Namely, 



1 

3 

/(*i) 

.5 

1 .5 

_ 


Vi 

-3 

2 

4 

g(Vj) 

.4 

.3 

.3 


Distribution of X Distribution of Y 


(ii) First compute fi x and fiy' 

Mx = 2 = (1)(.5) + (3)(.5) = 2 

My = 2 V) 9 (vj) = (—3)(-4) + (2)(.3) + (4)(.3) = .6 

Next compute E(XY): 

E(XY) — 2 *iJ/j h{x j, y } ) 

= <1)(—3)(.l) + (1)(2)(.2) + (1)(4)(.2) + (3)(—3)(.3) + (3)(2)(.l) + (3)(4)(.l) 
Then Cov (X, Y) = E(XY) - y x y Y = 0 - (2)(.6) = -1.2 


(iii) First compute <r x and ° r Y : 

E(XZ) = i) = (1)(.5) + (9)(.5) = 5 

4 = Var(X) = E(X?) - = 5 - (2)2 = 1 

a x = Vl = 1 

and 

E(Y*) = 2 y 2 j 9 (Vj) = (9)(.4) + (4)(.3) + (16)(.3) = 9.6 
aj = Var(F) = E(Y*) - My = 9.6 - (.6) 2 = 9.24 

a y — V 9.24 — 3.0 


Then 


P(X, Y) 


Cov (X, Y) _ -1.2 

a x a Y (1)(3.0) 


-.4 


0 


(iv) X and Y are not independent, since P(X = 1, Y = —3) 7 ^ P(X = 1) P(F = —3), i.e. the entry 
h( 1, —3) = ,1 is not equal to /(l) ^(—3) = (.5)(.4) = .2, the product of its marginal entries. 


5.9. Let X and Y be independent random variables with the following distributions: 


Vi 

5 

10 

15 

a{y j) 

.2 

.5 

.3 



1 

2 

f(Xi) 

.6 

.4 


Distribution of X Distribution of Y 

Find the joint distribution h of X and Y. 

Since X and F are independent, the joint distribution h can be obtained from the marginal 
distributions / and g. First construct the joint distribution table with only the marginal distributions 
as shown below on the left, and then multiply the marginal entries to obtain the other entries, i.e. 
set h(x { , ?/j) = g(yj), as shown below on the right. 


Y 

x\. 

5 

10 

15 

Sum 

1 




.6 

2 




.4 

Sum 

.2 

.5 

.3 



Y 

X 

5 

10 

15 

Sum 

1 

.12 

.30 

.18 

.6 

2 

.08 

.20 

.12 

.4 

Sum 

.2 

.5 

.3 
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5.10. A fair coin is tossed three times. Let X denote 0 or 1 according as a head or a tail 
occurs on the first toss, and let Y denote the number of heads which occur. Determine 
(i) the distributions of X and Y, (ii) the joint distribution h of X and Y, (iii) Cov (. X, Y). 

(i) The sample space S consists of the following eight points, each with probability 
S “ {HHH, HHT, HTH, HTT, THH, THT, TTH, TTT) 

We have V(HHH) = 0, X(HHT) = 0, V(HTH) = 0, X(HTT) = 0 

X(THH) = 1, Z(THT) = 1, X(TTH) = 1, Z(TTT) = 1 

and 7(HHH) = 3 

Y(HHT) = 2, Y(HTH) = 2, Y(THH) = 2 
Y(HTT) = 1, Y(THT) = 1, Y(TTH) = 1 
Y(TTT) = 0 

Thus the distributions of X and Y are as follows: 


x i 

0 

1 

/(*! i ) 

1 

2 

i 


Vi 

0 

1 

2 

00 

g(y ,) 

* 

C©j00 

3 

8 

1 

8 


Distribution of X Distribution of Y 


(ii) The joint distribution h of X and Y follows: 


\ r 

0 

1 

2 

3 

Sum 

0 

0 

8 

t 

x 

8 

1 

2 

1 

l 

8 

1 

i 

0 

1 

2 

Sum 

1 

8 

& 

8 

& 

8 

4 



We obtain, for example, the entry h{ 0, 2) = P(X = 0, Y = 2) = P({HTH, HHT}) - £. 

(iii) fi x = 2®i/(»i) = ^ 

vy = 2 Vj g(yj) - o«£ + i- § + 2- £ + s*£ = f 

E(XY) = 2 XiVj H x v 2/j) = l*l*£ + l # 2-£ + terms with a factor 0 = -| 

Cov(X,Y) = E{XY) - hxmy = *"*•! = "i 


5.11. Let A be a random variable with the following distribution and let Y — X 2 : 



-2 

-1 

1 

2 

/(* i) 

i 

4 

4 

X 

4 

i 

4 


Determine (i) the distribution g of Y, (ii) the joint distribution h of X and Y, 
(iii) Cov (X, Y) and P (X, Y). 
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(i) Since Y = X 2 , the random variable Y can only take on the values 4 and 1. Furthermore, 

g( 4) = P(Y = 4) = P(X = 2orX = ~2) = P(A = 2) + P(X = -2) = £ + £ = £ 
and, similarly, #(1) = T. Hence the distribution g of Y is as follows: 


Vj 

1 

4 

g{y }) 


i 


(ii) The joint distribution h of X and Y appears below. Note that if X = —2, then Y = 4; hence 
/t( — 2,1) = 0 and &(—2, 4) = /(—2) — The other entries are obtained in a similar way. 


\ F 

X \ 

1 

4 

Sum 

-2 

0 

l 

4 

X 

4 

-1 

l 

4 

0 

_L 

4 

1 

1 

4 

0 

X 

4 

2 

0 

1 

t 

X 

4 

Sum 

1 

2 

1 

2 



(hi) Vx 

fly 

E{XY) 
Cov (A, Y) 


= 2 *(/(*<) = -24-14 + 14 + 2.1 = 0 

w = 2 V)9(v,) = 14 + 44 = 4 

2 *i!/jVj) = -8-4 - 1*4 4 - 1-4 + 8*4 = 0 
E(XY) — nxfty — 0 — 0*4 = ® and so p(^> Y) = 0 


Remark: This example shows that although Y is a function of X it is still possible for the 
covariance and correlation of X and Y to be 0, as in the case when X and Y are inde¬ 
pendent (Theorem 5.6). Notice, however, that X and Y are not independent in this 
example. 


PROOFS OF THEOREMS 

Remark: In all the proofs, X and Y are random variables with distributions / and g 
respectively and joint distribution h . 


5.12. Show that f(Xi) = ^ h(Xi,yj) and g(yj) = ^ h(Xi,yi), i.e. that the marginal distribu- 

i i 

tions are the (individual) distributions of X and Y. 

Let A i = {X = 't i } and B j = {Y = y j ); that is, let A t = X~ 1 (x i ) and Bj = Y" x (y f ). Thus the 
Bj are disjoint and S=u } Bj. Hence 

Ai = S = AiMUjBj) = U jiAiOBj) 

where the A^Bj are also disjoint. Accordingly, 

/(*i) = P(X = xO = P(Ai) = 2 PAiCiBj) = 2 P(X = x i ,Y = Vj ) = 2 h(Xi, yj ) 

3 3 3 

The proof for g is similar. 


5.13. Prove Theorem 5.8: Let X and Y be random variables on the same sample space S 
with Y — $(X). Then E(Y) — 2 ®(Xi)f(Xi) where / is the distribution of X. 

i 

(Proof is given for the case X is discrete and finite.) 
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Suppose that X takes on the values x lf .. .,x n and that <*>( 0 ^) takes on the values y lf .. < s y m as 
i runs from 1 to n. Then clearly the possible values of F - <t>(X) are y l9 .. ,,y m and the distribu¬ 
tion g of F is given by 


ffiVj) = 


2 

{i : 


/(*i) 


Therefore 


E(Y) 


2 Vjffiyj) = 2 yj 2 /(&i) 

i = 1 i=l 0(x { ) = yi } 


which proves the theorem. 


2 /(**) 2 Vj 

i=l (j : ^Xi>= yj > 


.2 /fo) *(a?i) 


5.14. Prove Theorem 5.1: Let X be a random variable and k a real number. Then 
(i) E(kX) = kE(X) and (ii) + = E(X) 4- k. 

(Proof is given for the general discrete case with the assumption that E(X) exists.) 

(i) Now kX = $(X) where <f>(x) = kx. Therefore by Theorem 5.8 (Problem 5.13), 

E(kX) = 2 kxiffri) = k 2*i/(*i) = kE(X) 

(ii) Here X + k = 4»(Z) where 3>(a?) = x + k. Therefore 

E(X + k) = 2 («t + fc)/(a?i) = 2*i/(*i) + 2fc/(«i) = £?(*) + k 

i i i 


5.15. Prove Theorem 5.2: Let X and Y be random variables on the same sample space S 
Then E{X + Y) — E(X) + E(Y). 

(Proof is given for the general discrete case with the assumption that E(X) and E(Y) both 
exist.) 

Now X + Y = 4>(X, Y) where <t(x, y) = x + y. Therefore by Theorem 5.9, 

E(X + Y) = 2 5(ij + Vj) h(x u j fj) = 22*j h(x i( y) +22 Vj h(x ( , y } ) 

13 i j i j J 

Applying Problem 5.12, we get 

E(X+Y) = 2 + 2 V s g(Vj) = E(X) + E(Y) 

i j 


5.16. Prove Corollary 5.3: Let X u Z 2 , ..., X n be random variables on S. Then 

E(X i + • ■ • + X n ) = E(X t ) + • - • + E(X n ) 

(Proof is given for the general discrete case with the assumption that E(X t ), .. ,,E(X n ) all 
exist.) 

We prove this by induction on n. The case n — 1 is trivial and the case n — 2 is just Theorem 
5.2 (Problem 5.15). For the case n > 2 we apply the case n = 2 to obtain 

E(X t + ••• +X n _! + X n ) = E(X t + ••• +X n _ 1 ) + E(X n ) 

and by the inductive hypothesis this becomes E(X t ) + • • • + E(X n _ j) + E(X n ). 


5.17. Prove Theorem 5.5: (i) Var (X + k) = Var (X) and (ii) Var (kX) = k 2 Var (X). Hence 

°X + k ~ °x an ^ a kX = 1^1 

By Theorem 5.1, Mx + k = /*x + * an d Pkx = hPx- Also 2 **/(*<) = J“x and 2 /(*») = 1. Hence 




96 


RANDOM VARIABLES 


[CHAP. 5 


Var (X + k) = 2 (* 4 + fc) 2 /<*t) “ Mx + fc 

= 2 *? /(*i) + 2fc 2 /(*«) + fc 2 2 /(*i) - (i“x + fc) 2 

= 2 /(*<) + 2fc/«x + k 2 — + 2fc/i X + fc 2 ) 

= ~ 4 = Var (A) 

and Var(fcX) = 2 (fc*<) 2 /(*i) “ Mkx = fc 2 2 *?/(*■>- (fcMx) 2 

= fc 2 2 ^/(^ - fcV 2 = fc 2 (2 x 2 /(* i ) - 4) = fc 2 Var (A) 


5.18. Show that 

Cov(X, Y) = '2 l (x t - l Xx){y j -ii Y )h(x i ,y) = 2 M*i>^j) ~ W 

i,j i>j 

(Proof is given for the case when X and F are discrete and finite.) 

Since 

2 2/j fc(*i, jo) = 2l/jfl , (2/j) = My. 2*i h^vVj) = 2«i/(»i) = Mx and 2 fc(*i> Vj) = 1 

i, j j i,3‘ * *,3 

we obtain 

2 (*i - Mx)(2/j ~ My) i/j) 

i,3 

= 2 - fixVj - VY%i + V'XM'y) H%i, 2/j) 

i*3 

= 2 a>i2/j fc(*i, 2/j) - Mx 2 2/j fc(*i, 2/j) ~ My2*i M*i> 2/j) + MxMy 2 M*i. 2/j) 

i,j i,j i,3 i,3 

= 2 #i2/j 2/j) ~ Mx^Y — MxMy + fiX^Y 

i,i 

= 2 x iVj H*u 2/j) ~ MxMy 

1,3 


5.19. Prove Theorem 5.6: Let X and Y be independent random variables. Then 
(i) tf(XY) = E(X)E(Y), (ii) Var (X + Y) = Var (X) + Var (Y), (iii) Cov(X,Y) = 0. 

(Proof is given for the case when X and Y are discrete and finite.) 

Since X and Y are independent, h(x it %/j) = /(x 4 ) g{Vj). Thus 

E(XY) = '2,x i y j h(x i ,y i ) = 2 *iJ/j/(*<) g(y t ) 

iJ *'3 

= 2*i/(*<)2 Vig(vj> = E(X)E(Y) 

i i 

and Cov (X, Y) — E(XY) — p x p Y = J£OT#(F) — MxMy — 0 

In order to prove (ii) we also need 

p*x+Y = /*x + 2 Vj) — 2 2 y j 2/j) 2 Vj ffivj ) 

i, j i J J 

Hence 

= 2 (*i + 2/j) 2 fc(*i,2/j) — MX + Y 

t, j 

= + 2 2 Vi) + 2 2/j 2/j) — (mx + ^y) 2 

ij j 1 i,i iJ 

= 2 «i/(«i) + 22 iCi/(a?j) 2 2/j 27(2/j) + 2 2/j i/(2/j) " 4” 2 /^xMy ” M 

t i 5 3 

= 2 *?/(**) - m 2 x + 2t/>(j/j) - M 2 = Var (X) + Var (F) 


Var(A+V) 


Kj ^ 
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5.20. Prove Theorem 5.7: Let Xi,X 2 ,.. ,,X n be independent random variables. Then 

Var(Xi + • • • +X n ) = Var (Xi) + • • • + Var (X„) 

(Proof is given for the case when X lf ..., X n are all discrete and finite.) 

We take for granted the analogs of Problem 5.12 and Theorem 5.9 for n random variables. Then 


Var (X x -f • ■ • + X n ) = E((X 1 +---+X n -p Xx +...+ x% ) *) 


= 2 (*!+•••+*„ - MX t + ... +x n ) 2 *(*l. •••>*«) 

= 2 (*x + • • • + X n - /i Xl - Mx n ) 2 h(x lt ...,x n ) 

= 2^2 2 +22 fiXzfiX: ~ 22 2 fix.*}?- h ( x i> •••»*„) 

1 J 13 X 3 I 


where ft is the joint distribution of X lt .. .,I n , and /i Xi+ ... +x ^ = p Xl + • • • + / 2 X ^ (Corollary 5.3). 
Since the are pairwise independent, 2 x { Xjh(x u . .., x n ) = p x .p x . f° r ^ ^ 7* Hence 

l J 


Var (X x + ••• +X n ) 


2 fix.fix, 
i#j 1 J 


+ 


Seix;, 


+ 


2 2 


-222 


y ^ x / x j 


= .2 E(X\) - 2 (MX.) 2 = 2 Var (AO 

. , * = 1 <=1 1 i=l 

as required. 


MISCELLANEOUS PROBLEMS 


5.21. 


Let X be a continuous random variable with distribution 


/(*) 


jix + k if 0 — # — 3 
[0 elsewhere 


(i) Evaluate k . (ii) Find P(1 — X^2). 

(i) The graph of / is drawn below. Since / is a continuous probability function, the shaded 
region A must have area 1. Note A forms a trapezoid with parallel bases of lengths ft and ft + 
and altitude 3. Hence the area of A — -J(ft H- ft 4- ^) *3 = 1 or k — JL. 




(ii) P(1 — X —2) is equal to the area of B which is under the graph of f and between x = 1 and 
x — 2 as shown above on the right. Note /(1) — ^ — yz> /(2) = i T 2 = A* Hence 

P(1^X^2) = areaof B = £(& + &)•! = f 


5.22. 


Let A be a continuous random variable whose distribution / is constant on an interval, 
say I — {a — x~ b], and 0 elsewhere: 


m 


k if a — x — b 
0 elsewhere 


(Such a random variable is said to be uniformly distributed on 7.) (i) Determine k . 

(ii) Find the mean n of X. (iii) Determine the cumulative distribution function F of X. 





98 


RANDOM VARIABLES 


[CHAP. 5 


(i) 


(ii) 


The graph of f appears on the right. The region 
A must have area 1; hence 

k(b - a) = 1 - - 1 


or 


k = 


b — a 


If we view probability as weight or mass, and 
the m^an as the center of gravity, then it is 
intuitively clear that 

a + b 

P = 


/ = 0 


i 

K 

J!_Li 


f = k 


b 


, ! 


Graph of / 


f = 0 


the point midway between a and 6. We verify this mathematically using calculus: 


(i = E(X) = ^ x f(x) dx = f YZ~^ dx 

R *a 

b 2 a 2 _ a + b 

" 2 (b - a) ” 2(6 - a) ~ 2 


[_ 2(6 — a) 


(iii) Recall that the cumulative distribution function 
F is defined by F(k) = P(X — k). Hence F(k) 
gives the area under the graph of / to the left 
of x = k> Since X is uniformly distributed on 
the interval I — (a — x — 6}, it is intuitive that 
the graph of F should be as shown on the right, 
i.e. F = 0 before the point a, F = 1 after the 
point b, and F is linear between a and 6, We 
verify this mathematically using calculus: 


F = 1 


F^O 



1 


Graph of F 


(а) for x < a, 

Fix) = 

(б) for a- — x — b, 



(c) for x > b, F(x) = P(X ^ ») ^ P(X ^ b) = F(6) = 1 and also 1 p(X ^ a?) = F(x); 
hence F(x) = 1. 



5.23. Let X be a random variable with mean /x and standard deviation <r > 0; and let X* be 
the standardized random variable corresponding to X , i.e. X* = (X — /x)/a. Show 
that E(X*) = 0 and Var(X*) = l. (Hence <r x * = 1.) 

By Theorem 5.1 and Theorem 5.5, 

E(X*) = = i E(X-p) = -(E(A) — /«) = 0 

\ ° ° ° 

and Var (X*) = Var = ^ Var (.X-p) = ^Var (V) = J = 1 

5.24. Let X be a random variable with distribution /. The rth moment M r of X is defined by 

Mr = E(X*) = 2 < /(«i) 

Find the first five moments of X if X has the following distribution: 


Xi 

-2 

1 

CO 

/(*«) 

2 

i 

4 


(Note that Mi is the mean of X, and M 2 is used in computing the variance and standard 
deviation of X.) 
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My = 2*i/(*j) = 

m 2 = 2*f/(«i) = 

^3 = 2 /(*i) - 

M 4 = 2 ** /(a?i) = 

M s = 2 *f /(*,) = 


-2*i + l'i + 3-J = 0, 

4 • £ + 1 • £ + 9 • £ = 4.5, 

-8 • £ + 1 • £ + 27 • £ = 3, 

16 • £ + 1 • £ + 81 • £ = 28.5, 
—32 • £ + 1 • £ + 243 • £ = 45. 


5.25. Let h be the joint distribution of the random variables X and Y. (i) Show that the 
distribution / of the sum Z = X + Y can be obtained by summing the probabilities 
along the diagonal lines x + y = z k , i.e. 

f{Zk) = 2 Vi) = 2 h(Xi, Zk ~ Xi) 

Z k = x i + Vj Xi 

(ii) Apply (i) to obtain the distribution f of the sum Z = X + Y where X and Y have 
the following joint distribution: 


\ y 

-2 

-1 

0 

1 

2 

3 

Sum 

0 

.05 

.05 

.10 

0 

.05 

.05 

.30 

i 

.10 

.05 

.05 

.10 

0 

.05 

.35 

2 

.03 

.12 

.07 

.06 

.03 

.04 

.35 

Sum 

.18 

.22 

.22 

.16 

.08 

.14 



(i) The events {X = x t ,Y = y, : + y t = z k ] are disjoint; hence 

f(z k ) = P(Z — z k ) = 2 P(X = x u Y = y } ) 

Xi + yj = z k 

= 2 h(%i, Vj) = 2 h(x lt z k - Xi ) 

x i + V) = z k x i 


(ii) 


Y 

X 

—2 

-1 

0 

1 

2 

3 

0 

.05 

.05 

.10 

0 

.05 

.05 

1 

.10 

.05 

.05 

.10 

0 

.05 

2 

.03 

.12 

.07 

.06 

.03 

.04 


Adding along the diagonal lines in the above table, we obtain 


/(-2) = .05 

/(-l) = .05 + .10 = .15 
/(0) = .10 + .05 + .03 = .18 

/(1) - 0 + .05 + .12 = .17 


/(2) = .05 + .10 + .07 = .22 
/(3) = .05 + 0 + .06 = .11 
/(4) = .05 + .03 = .08 
/(5) = .04 


In other words, the distribution of Z = X + Y is as follows: 



-2 

-1 

0 

1 

2 

3 

4 

5 

/(*«) 

.05 

.15 

.18 

.17 

.22 

.11 

.08 

.04 
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Supplementary Problems 

RANDOM VARIABLES 

5.26. Find the mean /i, variance a 2 and standard deviation u of each distribution: 



5.27. A pair of fair dice is thrown. Let X be the random variable which denotes the minimum of the 
two numbers which appear. Find the distribution, mean, variance and standard deviation of X . 

5.28. A fair coin is tossed four times. Let X denote the number of heads occurring. Find the distribution, 
mean, variance and standard deviation of X. 

5.29. A fair coin is tossed four times. Let Y denote the longest string of heads occurring. Find the 
distribution, mean, variance and standard deviation of Y . 

5.30. Find the mean / l , variance a 2 and standard deviation a of the two-point distribution 



a 

b 

/<* i) 

V 

<? 


where p 4- q — 1. 

5.31. Two cards are selected at random from a box which contains five cards numbered 1, 1, 2, 2 and 3. 
Let X denote the sum and Y the maximum of the two numbers drawn. Find the distribution, mean, 
variance and standard deviation of (i) X, (ii) Y, (iii) X + Y, (iv) XY. 

EXPECTATION 

5.32. A fair coin is tossed until a head or four tails occur. Find the expected number of tosses of the coin. 

5.33. A coin weighted so that P(H) = 1 and P(T) = $ is tossed until a head or five tails occur. Find 
the expected number of tosses of the coin. 

5.34. A box contains 8 items of which 2 are defective. A man selects 3 items from the box. Find the 
expected number of defective items he has drawn. 

5.35. A box contains 10 transistors of which 2 are defective. A transistor is selected from the box and 
tested until a nondefective one is chosen. Find the expected number of transistors to be chosen. 

5.36. Solve the preceding problem in the case that 3 of the 10 items are defective. 

5.37. The probability of team A winning any game is -J. A plays team B in a tournament. The first team 
to win 2 games in a row or a total of three games wins the tournament. Find the expected number 
of games in the tournament. 

5.38. A player tosses three fair coins. He wins $5 if 3 heads occur, $3 if 2 heads occur, and $1 if only 
1 head occurs. On the other hand, he loses $15 if 3 tails occur. Find the value of the game to the 
player. 

5.39. A player tosses three fair coins. He wins $8 if 3 heads occur, $3 if 2 heads occur, and $1 if only 1 
head occurs. If the game is to be fair, how much should he lose if no heads occur? 
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5.40. A player tosses three fair coins. He wins $10 if 3 heads occur, $5 if 2 heads occur, $3 if 1 head 
occurs and $2 if no heads occur. If the game is to be fair, how much should he pay to play the game? 

JOINT DISTRIBUTION, INDEPENDENT RANDOM VARIABLES 

5.41. Consider the following joint distribution of A and Y : 


Y 

AX. 

-4 

2 

7 

Sum 

1 

i 

i 

i 

i | 

5 

4 


i 


Sum 

t 

t 

i 



Find (i) E(X) and E(Y), (ii) Cov ( X , Y), (iii) <r x , <r Y and p(Xj Y)* 
5.42. Consider the following joint distribution of X and Y: 


\ y 

A \ 

-2 

-1 

4 

5 

Sum 

1 

.1 

.2 

0 

.3 

.6 

2 

.2 

.1 

.1 

0 

.4 

Sum 

.3 

.3 

.1 

.3 



Find (i) E(X) and E(Y), (ii) Cov (X, Y ), (iii) <r x , <r Y and p(X, Y). 

5.43. Suppose X and Y are independent random variables with the following respective distributions: 


Vi 

-2 

5 

8 

s(Vj) 

.3 

.5 

.2 



1 

2 

/(* t) 

.7 

.3 


Find the joint distribution of X and Y, and verify that Cov (X, Y) = 0. 

5.44. A fair coin is tossed four times. Let X denote the number of heads occurring and let Y denote the 
longest string of heads occurring (see Problems 5.28 and 5.29). (i) Determine the joint distribution 
of X and Y. (ii) Find Cov (. X , Y) and p(X, Y). 

5.45. Two cards are selected at random from a box which contains five cards numbered 1, 1, 2, 2 and 3. 
Let X denote the sum and Y the maximum of the two numbers drawn (see Problem 5.31). (i) Deter¬ 
mine the joint distribution of X and Y. (ii) Find Cov (X, F) and p{X, y). 


MISCELLANEOUS PROBLEMS 

5.46. Let A be a continuous random variable with distribution 


(i) 

(ii) 


m = 


£ if 0 ^ a ^ 8 
0 elsewhere 


Find: P(2 ^ X ^ 5), P( 3 ^ A ^ 7) and P(X ^ 6). 

Determine and plot the graph of the cumulative distribution function F of X. 


5.47. 


Let A be a continuous random variable with distribution 


/(*) 


f kx if 0 — x — 5 
[0 elsewhere 


(i) Evaluate k. (ii) Find P(1 — A — 3), P(2 — A — 4) and P{ X ^ 3). 
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5.48. Plot the graph of the cumulative distribution function F of the random variable X with distribution 



-3 

2 

6 

f(Xi) 

i 


i 


5.49. Show that v x — 0 if and only if X is a constant function, i.e. X(s) = k for every s G S, or simply 
X — k, 

5.50. If a x # 0, show that p(X,X) = 1 and p(X, -X) = -1. 

5.51. Prove Theorem 5.9: Let X, Y and Z be random variables on S with Z = 4>(X, Y). Then 

E(Z) = 2 •Hxj, Vj) h(Xi, yj) 
t*5 

where h is the joint distribution of X and Y. 


Answers to Supplementary Problems 

5.26. (i) /. = 4, <r2 3= 5.5, o = 2.3; (ii) p. = 0, = 10, v = 3.2; (iii) p = 1, = 2.4, <r = 1.5. 


5.27. 



1 

2 

3 

4 

5 

6 

/(*«> 

36 

9 

36 

7 

36 

5 

36 

3 

36 

l 

36 


E(X) = 2.5, Var (X) = 2.1, <r x = 1.4 


5.28. 


Xi 

0 

1 

2 

3 

4 

m 

l 

1 6 

4 

1 6 

6 

16 

4 

1 6 

l 

1 6 


E(X) = 2, Var (X) = 1, <r x = 1 


5.29. 


2/j 

0 

1 

2 

3 

4 

ffO/j) 

l 

16 

7 

1 6 

5 

1 6 

2 

1 6 

1 

1 6 


E(Y) = 1.7, Var (F) = 0.9, o Y = 0.95 


5.30. /u = ap + bq, a 2 = pq(a—b) 2 , a = \a—b\^/pq 


5.31. (i) 


(ii) 



2 

3 

4 

5 

/(**) 

.1 

.4 

.3 

.2 


E{X) = 3.6, Yar (. X ) = .84, a x = .9 


2/j 

1 

2 

3 

ffiVj) 

.! , 

.5 

.4 


E(Y) = 2.3, Var (Y) = .41, a y = .64 
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Graph of F 














Chapter 6 


Binomial, Normal and 
Poisson Distributions 


BINOMIAL DISTRIBUTION 

We consider repeated and independent trials of an experiment with two outcomes; we 
call one of the outcomes success and the other outcome failure . Let p be the probability of 
success, so that q — 1 — p is the probability of failure. If we are interested in the number 
of successes and not in the order in which they occur, then the following theorem applies. 


Theorem 6.1: 


The probability of 
given by 


exactly k successes in n repeated trials is denoted and 
b(k;n,p) = Q)p k q n ~ k 


Here (J) is the binomial coefficient (see page 19). Observe that the probability of no suc¬ 
cesses is q n , and therefore the probability of at least one success is 1 - 


Example 6.1: A fair coin is tossed 6 times or, equivalently, six fair coins are tossed; call heads 
a success. Then n = 6 and p = q = ■£. 

(i) The probability that exactly two heads occur (i.e. k = 2) is 

6(2; 6,^) = ( 6 2 )4)*(i) 4 - if 

(ii) The probability of getting at least four heads (i.e. k = 4, 5 or 6) is 

6(4; 6,^) + 6(5; 6,*) + 6(6; 6,*) = (•)<*)«(*)* + ( 6 5 ) <*)« (*) + (®) (*)• 

= = 11 
64 ' 64 ' 64 32 

(iii) The probability of no heads (i.e. all failures) is q Q = (i) 6 = JL, and so the 
probability of at least one head is 1 — q 6 ”l—= M. 

^ 64 64 


Example 6.2: A fair die is tossed 7 times; call a toss a success if a 5 or a 6 appears. Then n — 7, 
V = -P({5, 6}) = f and q — 1 - p = £. 

(i) The probability that a 5 or a 6 occurs exactly 3 times (i.e. k = 3) is 

*<8;7.*) = ( 3 7 )(i) 3 (f) 4 - 

(ii) The probability that a 5 or a 6 never occurs (i.e. all failures) is q 7 = (-|) 7 — t~|-; 
hence the probability that a 5 or a 6 occurs at least once is 1 - q 7 = |ff|. 
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If we regard n and p as constant, then the above function P(k) = b(k;n,p) is a dis¬ 
crete probability distribution: 



It is called the binomial distribution since for k = 0,1, 2, ..., n it corresponds to the suc¬ 
cessive terms of the binomial expansion 

0 q + p) n = q n + (i )q n ~ 1 p + Q q n ~ 2 p 2 + ••• + p n 

This distribution is also called the Bernoulli distribution, and independent trials with two 
outcomes are called Bernoulli trials. 


Properties of this distribution follow: 


Theorem 6.2: 



Example 6 k 3: A fair die is tossed 180 times. The expected number of sixes is = np == 

180 • d. = 30. The standard deviation is a — ifnpq = Vl80 • ^ • f — 5. 


NORMAL DISTRIBUTION 


The normal (or: Gaussian) distribution or curve is defined as follows: 


f{x) 



where /x and <j > 0 are arbitrary constants. This function is certainly one of the most 
important examples of a continuous probability distribution. The two diagrams below 
show the changes in / as ^ varies and as a varies. In particular, observe that these bell¬ 
shaped curves are symmetric about x = //,. 
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Properties of the normal distribution follow: 

Theorem 6.3: 


Normal distribution 

Mean 


Variance 

<7* 

Standard deviation 

o 


We denote the above normal distribution with mean [n and variance a 2 by 

N(n, a 2 ) 

If we make the substitution t= (x — /n)/o- in the above formula for N (/<,, a 2 ) we obtain the 
standard normal distribution or curve 

m = 

y%Tr 

which has mean ^ = 0 and variance a* = 1. The graph of this distribution appears below. 
We note that for — 1 ^ t ^ 1 we obtain 68.2% of the area under the curve, and for 
_ 2 ^ t ^ 2 we obtain 95.4% of the area under the curve. 



A table on page 111 gives the area under the standard normal curve between t — 0 and 
any positive value of t. The symmetry of the curve about t = 0 permits us to obtain the area 
between any two values of t (see Problem 6.14). 

Now let A be a continuous random variable with a normal distribution; we frequently 
say that X is normally distributed . We compute the probability that X lies between a and b, 
denoted by P(a — X — b), as follows. First we change a and b into standard units 

a ' = (a — ju)/<r and 6' = (b — ^/fr 

respectively. Then 

P(a^X^b) = P(a'^X*^b') 

= area under the standard normal curve between a' and 6' 

Here X* is the standardized random variable (see page 79) corresponding to X, and hence 
X* has the standard normal distribution N( 0, 1). 
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NORMAL APPROXIMATION TO THE BINOMIAL DISTRIBUTION. 

CENTRAL LIMIT THEOREM 

The binomial distribution P(k) = b(k;n,p) is closely approximated by the normal 
distribution providing n is large and neither p nor q is close to zero. This property is 
indicated in the following diagram where we have chosen the binomial distribution corre¬ 
sponding to n = 8 and p — q = •£. 


k 

0 

1 

2 

3 

4 

5 

6 

7 

8 

p(k) 

i 

8 

28 

56 

70 

56 

28 

8 

l 

256 

256 

_1 

256 

256 

256 

256 

256 

256 

256 


Binomial distribution with n — 8 and p — q — ^ 



Comparison of the binomial and normal distributions 


The above property of the normal distribution is generalized in the Central Limit 
Theorem which follows. The proof of this theorem lies beyond the scope of this text. 


Central Limit Theorem 6.4: 


Let Xu X 2f ... be a sequence of independent random variables 
with the same distribution with mean p and variance <r 2 . 
Let 

y _ Xt + X 2 + * * ’ + X U — Tip 

71 ~ V ^ 

Then for any interval {a^x^b}, 

lim P(a — Z n — b) = P(a — <f> — b) 

n-+ oo 

where </> is the standard normal distribution. 


Recall that we called S n = (-3li + X 2 + * ■ • + X n )/n the sample mean of the random vari¬ 
ables Xu ..., X n . Thus Z n in the above theorem is the standardized sample mean. Roughly 
speaking, the central limit theorem says that in any sequence of repeated trials the standard¬ 
ized sample mean approaches the standard normal curve as the number of trials increase. 


POISSON DISTRIBUTION 

The Poisson distribution is defined as follows: 

P(k; A) = , k = 0,1, 2, ... 

where A > 0 is some constant. This countably infinite distribution appears in many natural 
phenomena, such as the number of telephone calls per minute at some switchboard, the 
number of misprints per page in a large text, and the number of a particles emitted by a 
radioactive substance. Diagrams of the Poisson distribution for various values of A follow. 
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Poisson distribution for selected values of X 


„.lll 

III,. 

0 2 4 6 8 

10 12 14 16 


x = 10 


Properties of the Poisson distribution follow: 


Theorem 6.5: 


Poisson distribution 

Mean 

fi — X 

Variance 

to 

II 

Standard deviation 

a — VX 


Although the Poisson distribution is of independent interest, it also provides us with a 
close approximation of the binomial distribution for small k provided that p is small and 
A = np (see Problem 6.27). This is indicated in the following table. 


k 

0 

1 

2 

3 

4 

5 

Binomial 

.366 

.370 

.185 

.0610 

.0149 

.0029 

Poisson 

.368 

.368 

.184 

.0613 

.0153 

.00307 


Comparison of Binomial and Poisson distributions 
with n = 100, p = 1/100 and A = np = 1 


MULTINOMIAL DISTRIBUTION 

The binomial distribution is generalized as follows. Suppose the sample space of an 
experiment is partitioned into, say, s mutually exclusive events At, A 2 , .. A s with respective 
probabilities p u P 2 , . .., p s . (Hence pi + P 2 + • • • + p* = 1.) Then: 


Theorem 6.6: 


In n repeated trials, the probability that A x occurs fci times, A 2 occurs fc 2 
times, ..., and A s occurs k s times is equal to 


nl 

ki ! fc 2 ! • • • k s ! 


P k l p k 2 • • • V k s 
*1 “ 2 S 


where ki 4- k 2 + • * • + k s = n. 


The above numbers form the so-called multinomial distribution since they are precisely 
the terms in the expansion of (pi + p 2 + • * • + p s ) n . Observe that if s = 2 then we obtain 
the binomial distribution, discussed at the beginning of the chapter. 


Example 6.4: 


A fair die is tossed 8 times. The probability of obtaining the faces 5 and 6 twice 
and each of the others once is 


8 ! 


2! 2! 1! 1! 1! 1! 




35 

5832 


.006 
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STANDARD NORMAL CURVE ORDINATES 

This table gives values cf>(t) of the / 

standard normal distribution <j> at t — 0 / 

in steps of 0.01. / 

( 

N 

r 

0«) 

Ll 

> i 

t. 


t 0 1 2 3 4 5 6 7 8 9 


]Aj|| 

.3989 

.3989 

.3989 

.3988 

.3986 

.3984 

.3982 

.3980 

.3977 

.3973 


.3970 

.3965 

.3961 

.3956 

.3951 

.3945 

.3939 

.3932 

.3925 

.3918 


.3910 

.3902 

.3894 

.3885 

.3876 

.3867 

.3857 

.3847 

.3836 

.3825 


.3814 

.3802 

.3790 

.3778 

.3765 

.3752 

.3739 

.3725 

.3712 

.3697 

0.4 

.3683 

.3668 

.3653 

.3637 

.3621 

.3605 

.3589 

.3572 

.3555 

.3538 

0.5 

.3521 

.3503 

.3485 

.3467 

.3448 

.3429 

.3410 

.3391 

.3372 

.3352 

0.6 

.3332 

.3312 

.3292 

.3271 

.3251 

.3230 

.3209 

.3187 

.3166 

.3144 

0.7 

.3123 

.3101 

.3079 

.3056 

.3034 

.3011 

.2989 

.2966 

.2943 

.2920 

0.8 

.2897 

.2874 

.2850 

.2827 

.2803 

.2780 

.2756 

.2732 

.2709 

.2685 

0.9 

.2661 

.2637 

.2613 

.2589 

.2565 

.2541 

.2516 

.2492 

.2468 

.2444 

1.0 

.2420 

.2396 

.2371 

.2347 

.2323 

.2299 

.2275 

.2251 

.2227 

.2203 

IKK 

.2179 

.2155 

.2131 

.2107 

.2083 

.2059 

.2036 

.2012 

.1989 

.1965 


.1942 

.1919 

.1895 

.1872 

.1849 

.1826 

.1804 

.1781 

.1758 

.1736 

I 

.1714 

.1691 

.1669 

.1647 

.1626 

.1604 

.1582 

.1561 

.1539 

.1518 


.1497 

.1476 

.1456 

.1435 

.1415 

.1394 

.1374 

.1354 

.1334 

.1315 


.1295 

.1276 

.1257 

.1238 

.1219 

.1200 

.1182 

.1163 

.1145 

.1127 

1.6 

.1109 

.1092 

.1074 

.1057 

.1040 

.1023 

.1006 

.0989 

.0973 

.0957 

1.7 

.0940 

.0925 

.0909 

.0893 

.0878 

.0863 

.0848 

.0833 

.0818 

.0804 

1.8 

.0790 

.0775 

.0761 

.0748 

.0734 

.0721 

.0707 

.0694 

.0681 

.0669 

1.9 

.0656 

.0644 

.0632 

.0620 

.0608 

.0596 

.0584 

.0573 

.0562 

.0551 

2.0 

.0540 

.0529 

.0519 

.0508 

.0498 

.0488 

.0478 

.0468 

.0459 

.0449 

2.1 

.0440 

.0431 

.0422 

.0413 

.0404 

.0396 

.0387 

.0379 

.0371 

.0363 

2.2 

.0355 

.0347 

.0339 

.0332 

.0325 

.0317 

.0310 

.0303 

.0297 

.0290 

2.3 

.0283 

.0277 

.0270 

.0264 

.0258 

.0252 

.0246 

.0241 

.0235 

.0229 

2.4 

.0224 

.0219 

.0213 

.0208 

.0203 

.0198 

.0194 

.0189 

.0184 

.0180 

2.5 

.0175 

.0171 

.0167 

.0163 

.0158 

.0154 

.0151 

.0147 

.0143 

.0139 

2.6 

.0136 

.0132 

.0129 

.0126 

.0122 

.0119 

.0116 

.0113 

.0110 

.0107 

2.7 

.0104 

.0101 

.0099 

.0096 

.0093 

.0091 

.0088 

.0086 

.0084 

.0081 

2.8 

.0079 

.0077 

.0075 

.0073 

.0071 

.0069 

.0067 

.0065 

.0063 

.0061 

2.9 

.0060 

.0058 

.0056 

.0055 

.0053 

.0051 

.0050 

.0048 

.0047 

.0046 

3.0 

.0044 

.0043 

.0042 

.0040 

.0039 

.0038 

.0037 

.0036 

.0035 

.0034 

3.1 

.0033 

.0032 

.0031 

.0030 

.0029 

.0028 

.0027 

.0026 

.0025 

.0025 

3.2 

.0024 

.0023 

.0022 

.0022 

.0021 

.0020 

.0020 

.0019 

.0018 

.0018 

3.3 

.0017 

.0017 

.0016 

.0016 

.0015 

.0015 

.0014 

.0014 

.0013 

.0013 

3.4 

.0012 

.0012 

.0012 

.0011 

.0011 

.0010 

.0010 

.0010 

.0009 

.0009 

3.5 

.0009 

.0008 

.0008 

.0008 

.0008 

.0007 

.0007 

.0007 

.0007 

.0006 

3.6 

.0006 

.0006 

.0006 

.0005 

.0005 

.0005 

.0005 

.0005 

.0005 

.0004 

3.7 

.0004 

.0004 

.0004 

.0004 

.0004 

.0004 

.0003 

.0003 

.0003 

.0003 

3.8 

.0003 

.0003 

.0003 

.0003 

.0003 

.0002 

.0002 

.0002 

.0002 

.0002 

3.9 

.0002 

.0002 

.0002 

.0002 

.0002 

.0002 

.0002 

.0002 

.0001 

.0001 


Table 6.1 
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STANDARD NORMAL CURVE AREAS 


This table gives areas under the stand¬ 
ard normal distribution <j> between 0 and 
t — 0 in steps of 0.01. 


D 

0 

1 

2 

3 

4 

5 

6 

7 

8 

9 

lfil» 

.0000 

.0040 

.0080 

.0120 

.0160 

.0199 

.0239 

.0279 

.0319 

.0359 

... 1 

.0398 

.0438 

.0478 

.0517 

.0557 

.0596 

.0636 

.0675 

.0714 

.0754 

«lf« 

.0793 

.0832 

.0871 

.0910 

.0948 

.0987 

.1026 

.1064 

.1103 

.1141 


.1179 

.1217 

.1255 

.1293 

.1331 

.1368 

.1406 

.1443 

.1480 

.1517 

0.4 

.1554 

.1591 

.1628 

.1664 

.1700 

.1736 

.1772 

.1808 

.1844 

.1879 

0.5 

.1915 

.1950 

.1985 

.2019 

.2054 

.2088 

.2123 

.2157 

.2190 

.2224 

0.6 

.2258 

.2291 

.2324 

.2357 

.2389 

.2422 

.2454 

.2486 

.2518 

.2549 

0.7 

.2580 

.2612 

.2642 

.2673 

.2704 

.2734 

.2764 

.2794 

.2823 

.2852 

0.8 

.2881 

.2910 

.2939 

.2967 

.2996 

.3023 

.3051 

.3078 

.3106 

.3133 

0.9 

.3159 

.3186 

.3212 

.3238 

.3264 

.3289 

.3315 

.3340 

.3365 

.3389 

1.0 

.3413 

.3438 

.3461 

.3485 

.3508 

.3531 

.3554 

.3577 

.3599 

.3621 

mm 

.3643 

.3665 

.3686 

.3708 

.3729 

.3749 

.3770 

.3790 

.3810 

.3830 

E£fl 

.3849 

.3869 

.3888 

.3907 

.3925 

.3944 

.3962 

.3980 

.3997 

.4015 

III 

.4032 

.4049 

.4066 

.4082 

.4099 

.4115 

.4131 

.4147 

.4162 

.4177 

I 

.4192 

.4207 

.4222 

.4236 

.4251 

.4265 

.4279 

.4292 

.4306 

.4319 

m 

.4332 

.4345 

.4357 

.4370 

.4382 

.4394 

.4406 

.4418 

.4429 

.4441 

19 

.4452 

.4463 

.4474 

.4484 

.4495 

.4505 

.4515 

.4525 

.4535 

.4545 

B9 

.4554 

.4564 

.4573 

.4582 

.4591 

.4599 

.4608 

.4616 

.4625 

.4633 

1.8 

.4641 

.4649 

.4656 

.4664 

.4671 

.4678 

.4686 

.4693 

.4699 

.4706 

1.9 

.4713 

.4719 

.4726 

.4732 

.4738 

.4744 

.4750 

.4756 

.4761 

.4767 

2.0 

.4772 

.4778 

.4783 

.4788 

.4793 

.4798 

.4803 

.4808 

.4812 

.4817 

2.1 

.4821 

.4826 

.4830 

.4834 

.4838 

.4842 

.4846 

.4850 

.4854 

.4857 

2.2 

.4861 

.4864 

.4868 

.4871 

.4875 

.4878 

.4881 

.4884 

.4887 

.4890 

2.3 

.4893 

.4896 

.4898 

.4901 

.4904 

.4906 

.4909 

.4911 

.4913 

.4916 

2.4 

.4918 

.4920 

.4922 

.4925 

.4927 

.4929 

.4931 

.4932 

.4934 

.4936 

2.5 

.4938 

.4940 

.4941 

.4943 

.4945 

.4946 

.4948 

.4949 

.4951 

.4952 

2.6 

.4953 

.4955 

.4956 

.4957 

.4959 

.4960 

.4961 

.4962 

.4963 

.4964 

2.7 

.4965 

.4966 

.4967 

.4968 

.4969 

.4970 

.4971 

.4972 

.4973 

.4974 

2.8 

.4974 

.4975 

.4976 

.4977 

.4977 

.4978 

.4979 

.4979 

.4980 

.4981 

2.9 

.4981 

.4982 

.4982 

.4983 

.4984 

I 

.4984 

.4985 

.4985 

.4986 

.4986 

3.0 

.4987 

.4987 

.4987 

.4988 

.4988 

.4989 

.4989 

.4989 

.4990 

.4990 

3.1 

.4990 

.4991 

.4991 

.4991 

.4992 

.4992 

.4992 

.4992 

.4993 

.4993 

3.2 

.4993 

.4993 

.4994 

.4994 

.4994 

.4994 

.4994 

.4995 

.4995 

.4995 

3.3 

.4995 

.4995 

.4995 

.4996 

.4996 

.4996 

.4996 

.4996 

.4996 

.4997 

3.4 

.4997 

.4997 

.4997 

.4997 

.4997 

.4997 

.4997 

.4997 

.4997 

.4998 

3.5 

.4998 

.4998 

.4998 

.4998 

.4998 

.4998 

.4998 

.4998 

.4998 

.4998 

3.6 

.4998 

.4998 

.4999 

.4999 

.4999 

.4999 

.4999 

.4999 

.4999 

.4999 

3.7 

.4999 

.4999 

.4999 

.4999 

.4999 

.4999 

.4999 

.4999 

.4999 

.4999 

3.8 

.4999 

.4999 

.4999 

.4999 

.4999 

.4999 

.4999 

.4999 

.4999 

.4999 

3.9 

.5000 

.5000 

.5000 

.5000 

.5000 

.5000 

.5000 

.5000 

.5000 

.5000 



Table 6.2 
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VALUES OF e~ x 

X 

0.0 

0.1 

0.2 

0.3 

0.4 

0.5 

0.6 

0.7 

0.8 

0.9 

e~ x 

1.000 

.905 

.819 

.741 

.670 

.607 

.549 

.497 

.449 

.407 

n 

1 

2 

3 

4 

5 

6 

7 

8 

9 

10 

1 e“ x 

.368 

.135 

.0498 

.0183 

.00674 

.00248 

.000912 

.000335 

.000123 

.000045 


Table 6.3 


Solved Problems 

BINOMIAL DISTRIBUTION 

6 . 1 . Find (i)&(2;5,£), (ii) 6(3; 6, *), (iii) 6(3; 4, i). 

Here b(k',n,p) = (] l)p k q n ~ k where p + q = 1. 

(i) K2; 5,*) = (=)(iP(|) 3 = (i) 2 (f) 3 = H. 

(ii) 6(8; 6,p = (•)(*)• (p* = f$f(p*(p* - 

(iii) 6(8; 4, J) = 0(i)3(|) = (t)(i) 3 (|) = \ (|) 3 (f) = *. 

6.2. A fair coin is tossed three times. Find the probability P that there will appear 
(i) three heads, (ii) two heads, (iii) one head, (iv) no heads. 

Method 1. We obtain the following equiprobable space of eight elements: 

S = {HHH, HHT, HTH, HTT, THH, THT, TTH, TTT} 

(i) Three heads (HHH) occurs only once among the eight sample points; hence P — 

(ii) Two heads occurs 3 times (HHT, HTH, and THH); hence P = %. 

(iii) One head occurs 3 times (HTT, THT and TTH); hence P — 

(iv) No heads, i.e. three tails (TTT), occurs only once; hence P = 

Method 2. Use Theorem 6.1 with n — 3 and p = q = 

(i) Here k = 3 and P = 6(3; 3,*) = (|) (l) 3 (\)° = 1 • £ • 1 = £. 

(ii) Here k = 2 and P = 6(2; 3, i) = (*) (|) 2 (£) = = f. 

(iii) Here k = 1 and P = 6(1; 3, A) = (J) (l) 1 (^) 2 = 3 • 1 • J = f. 

(iv) Here /c = 0 and P — 6(0; 3,1) = (jj) (^)° (l) 3 = 1 • 1 • £ = 

6.3. Team A has probability f of winning whenever it plays. If A plays 4 games, find 
the probability that A wins (i) exactly 2 games, (ii) at least 1 game, (iii) more than 
half of the games. 

Here n = 4, p = | and q — 1 — p = 

(i) P(2 wins) = 6(2; 4, f) = ( 2 ) (f) 2 (£) 2 = |> 
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( 11 ) Here q 4 - (£) 4 - ^ is the probability that A loses all four games. Then 1 - q* = |® j s t h e 
probability of winning at least one game. 1 

(iii) A wins more than half the games if A wins 3 or 4 games. Hence the required probability is 
P(3 wins) + P(4 wins) = (J) (§)3 (£) + (*) (|)4 = |f + |f = 


6.4. A family has 6 children. Find the probability P that there are (i) 3 boys and 3 girls, 

(n) fewer boys than girls. Assume that the probability of any particular child being 
a boy is 

Here n = 6 and p — q = ^. 

(i) P = P(3 boys) = (|) (|)3 (£)3 = = JL. 

(ii) There are fewer boys than girls if there are 0, 1 or 2 boys. Hence 

P = P(0 boys) + P (1 boy) + P(2 boys) = (*)• + (?) (i )(£) 5 + («) (£) 4 = li 


6.5. How many dice must be thrown so that there is a better than even chance of obtaining 
a six? 

, • ProbabiHty of not obtaining a six on n dice is (f)". Hence we seek the smallest n for 
which (f)” is less than 

<t>‘ = i■ <S> ! = »; (*)> = ffli ( t ). = g < i 

Thus 4 dice must be thrown. 


6 . 6 . 


The probability of a man hitting a target is $. (i) If he fires 7 times, what is the 

probability P of his hittting the target at least twice ? (ii) How many times must he 
fire so that the probability of his hitting the target at least once is greater than f ? 


(i) 


We seek the sum of the probabilities for k = 2 , 3, 4, 5 , 6 and 7 . It is simpler in this case to 
find the sum of the probabilities for * = 0 and 1 , i.e. no hits or 1 hit, and then subtract it from 1 . 

P(no hits) = (f)7 = P(i hit) = (J)(i )(|)6 = ^ 


Then P 


2187 _ 5103 _ 4547 

16,384 16,384 — 8192* 


(ii) The probability of not hitting the target is 9 ". Thus we seek the smallest n for which 9 " is 
less than 1 - | = i, where 9=l-p = l~i = f. Hence compute successive powers of 
q until q n < % is obtained: 

(f ) 1 = f « i; (f ) 2 = *<i; = but (f ) 4 = §- 6 < % 

In other words, he must fire 4 times. 


6.7. Prove Theorem 6.1: The probability of exactly k successes in n repeated trials is 
b(k;n,p) = Qp k q n ~ k . 

The sample space of the n repeated trials consists of all ordered n-tuples whose components 
are either s (success) or / (failure). The event A of k successes consists of all ordered n-tuples of 
which k components are s and the other n — k components are /. The number of n-tuples in the 
event A is equal to the number of ways that k letters s can be distributed among the n components 

of an n-tuple, hence A consists of (^) sample points. Since the probability of each point in A is 
pk qn-k, we have P(A) = (”) p* qn~K 
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6.8. Prove Theorem 6.2: Let I be a random variable with the binomial distribution 
b{k-,n,v). Then (i) E(X) = np and (ii) Var (X) = npq. Hence <j x = Ywpq. 


(i) Using b(k;n,p) = C)P k <l n ~ k > we obtain 


E(X) = 2 k-b{k-,n,p) = 2 k 

k =0 


2, k ~ r rT ~~—TTT P k <I n ~ k 

k =o k\(n — k)\ ^ 


np 2 


(n — 1 )! 


k =t (k-l)\(n-k)l 

(we drop the term k = 0 since its value is zero, and we factor out np from each term). We let 
s = Jc — 1 in the above sum. As k runs through the values 1 to n, s runs through the values 
0 to n ^ 1. Thus 


n -1 


E(X) = 


(ti-1)! 


=0 s 1 {n — 1 — s)! 


ps qn 


n—1 

= np 2 b(s;n— 1 , p) = np 

s = Q 


since, by the binomial theorem, 

n— 1 

2 6 (s;?i —l,p) = (p + ^) n “ x — l ”" -1 = 1 

s —0 


(ii) We first compute S(X 2 ): 


E(X>) = 2 kn(k-,n,p) = 2 -—?b- — pk q n-k 


jc—o A;! (n —fc)! 

(n - 1 )! 


wp 2 k 


(k-l)\(n-k)\ 


Again we let s = k — 1 and obtain 


E(XZ) = np 2 o (s + D s ; ( ( ”_ P s = wp2 o (s + l) 6 (s;n-l 


,f>) 


n-1 n-1 7l _ 1 

But 2 (s + 1) b(s; n— 1, p) = 2 s*6(s;w — 1, p) + 2 6 (s;n — l,p) 

s —0 s=° s_0 

= (w - l)p + 1 = wp + 1 - p = np + q 

where we use (i) to obtain (n — l)p. Accordingly, 

E(X 2 ) = 7 ip(up + g) = (tip ) 2 + n pq 

and Var (X) = E(X°-) - p* = (np ) 2 + npq - (np ) 2 = npq 

Thus the theorem is proved. 


6 9 Determine the expected number of boys in a family with 8 children, assuming the 
sex distribution to be equally probable. What is the probability that the expected 
number of boys does occur? 

The expected number of boys is E = np = 8 • £ = 4. The probability that the family has four 

boysls 6(4; 8,^) = 0(i) 4 (i) 4 = fro(i) 8 = m = ' 27 


6.10. The probability is 0.02 that an item produced by a factory is defective. A shipment 
of 10,000 items is sent to its warehouse. Find the expected number E of defective 
items and the standard deviation or. 

E = np ~ (10,000)(0.02) = 200. 
a = VnPf = V(10,000)(0.02)(0.98) = \/l96 = 14. 
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NORMAL DISTRIBUTION 


6 . 11 . 


The mean and standard deviation on an examination are 74 and 12 respectively. 
Find the scores in standard units of students receiving grades (i) 65, (ii) 74, (iii) 86, 
(iv) 92. 


(i) 



(ii) 



65 ~ 74 
12 

74 - 74 
12 


= -0.75 


= 0 


(iii) t 



(iv) 



86 - 74 
12 

92 - 74 
12 


1.0 


1.5 


6.12. Referring to the preceding problem, find the grades corresponding to standard scores 
(i) -1, (ii) 0.5, (iii) 1.25, (iv) 1.75. 

(i) » = at + H = (12)(—1) + 74 = 62 (iii) x = at + ft = (12)(1.25) + 74 = 89 

(ii) a? = ut + n = (12)<0.5) + 74 = 80 (iv) x = at + ft = (12)(1.75) + 74 = 95 

6.13. Let </> be the standard normal distribution. Find </>(£) at (i) t = 1.63, (ii) £ = —0.75, 
(iii) £ = —2.08. 

(i) In Table 6.1, page 110, look down the first column until the entry 1.6 is reached. Then con¬ 
tinue right to column 3. The entry is .1057. Hence 0(1.63) = .1057. 

(ii) By symmetry, 0 (—0.75) = 0(0.75) = .3011. 

(iii) 0(—2.08) = 0(2.08) = .0459. 


6.14. Let X be a random variable with the standard normal distribution </>. Find: 

(i) P(0 — X — 1.42) (v) P(—1.79 — X — —0.54) 

(ii) P(—0.73 — X — 0) (vi) P(X — 1.13) 

(iii) P(—1.37 ^ X ^ 2.01) (vii) P( \X\ ^ 0.5) 

(iv) P(0.65 — X — 1.26) 

(i) P(0~A — 1.42) is equal to the area under the 
standard normal curve between 0 and 1.42. Thus 
in Table 6 . 2 , page 111 , look down the first column 
until 1.4 is reached, and then continue right 
to column 2. The entry is .4222. Hence 
P(0 ^ X 1.42) = .4222. 

0 1.42 

(ii) By symmetry, 

P(—0.73 ^ X ^ 0) 

= P (0 — X — 0.73) = .2673 

-.73 0 




(iii) P(—1.37 — X — 2.01) 

= P(—1.37 — A ^ 0) + P(0 - 2.01) 

= .4147 + .4778 = .8925 
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(iv) P(0.65 — X — 1.26) 

= P(0 — X— 1.26) — P(0 — X — 0.66) 
= .3962 - .2422 = .1540 


(v) 


P(—1.79 — X — -0.64) 

= P(0.54 ^ X =£ 1.79) 

= P(0 — X— 1.79) - P(0 ^ X =£ 0.64) 
= .4633 - .2054 = .2579 


(vi) P(X— 1.13) 

= P(X - 0) — P(0 — X — 1.13) 
= .5000 - .3708 = .1292 


(vii) P(|X|^0.5) 

= P(—0.5 X ^ 0.5) 
= 2P(0 — X— 0.5) 

= 2(.1915) = .3830 






6.15. Let X be a random variable with the standard normal distribution <j>. Determine the 
value of t if (i) P(0^X^t) = .4236, (ii) P(X^t) = .7967, (iii) P(t^X^ 2) = .1000. 


(i) In Table 6.2, page 111, the entry .4236 appears 
to the right of row 1.4 and under column 3. 
Hence t = 1.43. 



(ii) Note first that t must be positive since the 
probability is greater than We have 

P(0 ^ X ^ t) = P(X ~ t) - ^ 

= .7967 - .5000 - .2967 

Thus from Table 6.2 we obtain t — .83. 



(iii) P(0 ^ X * t) = P(0 ^ X ^ 2) - P{t ^ X ^ 2) 

- .4772 - .1000 = .3772 

Thus from Table 6.2 we obtain t — 1.161 (by 
linear interpolation) or simply t = 1.16. 
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6.16. Suppose the temperature T during June is normally distributed with mean 68° and 
standard deviation 6°. Find the probability p that the temperature is between 70° 
and 80°. 


70° in standard units — (70 — 68)/6 = .33. 

80° in standard units = (80 — 68)/6 = 2.00. 
Then 

p = P(70 — T 7 — 80) = P(.33^T*^2) 

= P(0 ^ — 2) — P(0 ^ T* ^ .33) 

= .4772 - .1293 = .3479 

Here T* is the standardized random variable 
corresponding to T, and so T* has the standard normal 
distribution <f >. 



6.17. Suppose the heights H of 800 students are normally distributed with mean 66 inches 
and standard deviation 5 inches. Find the number N of students with heights 
(i) between 65 and 70 inches, (ii) greater than or equal to 6 feet (72 inches). 


(i) 65 inches in standard units = (65 — 66)/5 = —.20. 
70 inches in standard units = (70 — 66)/5 = .80. 
Hence 

P(65 70) = P(—.20 ^ H* ^ .80) 

= .0793 + .2881 = .3674 
Thus N = 800(.3674) = 294. 

(ii) 72 inches in standard units — (72 — 66)/5 = 1.20. 
Hence 

P(H — 72) = P(H* — 1.2) 

- .5000 - .3849 = .1151 
Thus N = 800(.1151) - 92. 

Here H* is the standardized random variable 
corresponding to H and so H* has the standard 
normal distribution <p. 




NORMAL APPROXIMATION TO THE BINOMIAL DISTRIBUTION 

6.18. A fair coin is tossed 12 times. Determine the probability P that the number of heads 
occurring is between 4 and 7 inclusive by using (i) the binomial distribution, (ii) the 
normal approximation to the binomial distribution. 


(i) By Theorem 6.1 with n = 12 and p = q = ■£, 


P(4 heads) = ( l 4 2 ) (£)*<i) 8 = 

P(5 heads) = <“)(*>•(*)* = & 

W heads) = <“><*)•<*)• = 

heads) = <“)<*>*(*>• = Jg- 


Hence 


p _ 495 , 792 , 924 . 792 _ 3003 __ 

*■ 4096 4096 " 1_ 4096 4096 — 4096 ~ 


.7332. 
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Probability of the number of heads occurring. 


Here p = np = 12 • i — 6 and u = yjnpq — = 1.73. Let A' denote the number 

of heads occurring. We seek P(4 — X — 7). But if we assume the data is continuous, in order 
to apply the normal approximation, we must find P(3.5 - X ^ 7.5) as indicated in the above 
diagram. Now 

3.5 in standard units = (3.5 — 6)/1.73 = —1.45. 

7.5 in standard units = (7.5 — 6)/1.73 = .87. 

Then P ~ P(3.5 — X — 7.5) 

= P(—1.45 — X* — .87) 

= .4265 + .3078 = .7343 



6.19. A fair die is tossed 180 times. Find the probability P that the face 6 will appear 
(i) between 29 and 32 times inclusive, (ii) between 31 and 35 times inclusive. 

Here p = np = 180 • i — 30 and a — 's/npq — y /180 • i £ — 5. Let X denote the number of 
times the face 6 appears. 

(i) We seek P( 29 — X — 32) or, assuming the data is con¬ 
tinuous;, P(28.5 — X — 32.5). Now 

28.5 in standard units = (28.5 — 30)/5 = —.3 

32.5 in standard units = (32.5 — 30)/5 = .5 

Hence 

P ~ P(28.5 — X - 32.5) = P(—.3 — X* ~ .5) 

- P(—.3 ^ X* ^ 0) + P(0 — X* — .5) 

= .1179 + .1915 = .3094 

(ii) We seek P(31 — X — 35) or, assuming the data is continuous, P(30.5 — X — 35.5). Now 

30.5 in standard units = (30.5 — 30)/5 — .1 

35.5 in standard units = (35.5 — 30)/5 — 1.1 

Then 

P « P(30.5 — X — 35.5) = P(.l — X* — 1.1) 

= P(0 ^ X* ^ 1.1) - P(0 ^ X* ^ .1) 

- .3643 - .0398 = .3245 




6.20. Among 10,000 random digits, find the probability P that the digit 3 appears at most 
950 times. 

Here p — np — 10,000 • = 1000 and cr — yJnpq — \/l0,000 • ^ ^ — 30. Let X denote the 

number of times the digit 3 appears. We seek P(X — 950). Now 
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950 in standard units = (950 — 1000)/30 
= -1.67 

Thus P ~ P(X — 950) = P(X* — —1.67) 

= P(X* — 0) — P(—1.67 — X* — 0) 

= .5000 - .4525 = .0475 



POISSON DISTRIBUTION 
6.21. Find: (i) e -1 - 3 , (ii) e -2 - 5 . 

By Table 6.3, page 112, and the law of exponents: 

(i) e-i-3 = (e~i)(e~ 0 - 3 ) = (,368)(.741) = .273. 

(ii) e- 2 - 3 = (e-2)(e-o.«) = (,135)(.607) = .0819. 


X k e~ 


6.22. For the Poisson distribution p{k; X) = find (i) p( 2; 1), (ii) p{ 3; £), (iii) p( 2; .7). 

(Use Table 6.3, page 112, to obtain e ~ x .) 


(i) p(2; 1) = 

(ii) p( 3;|) = 

(iii) p(2; .7) = 


1%~1 

e- 1 _ .368 

2! “ 

2 ~ 2 _ 

(l) 3e -.5 

e-'« _ .607 

3! 

48 _ 48 

(.7) 2 e~- 7 

(.49)(.497) 

2! 

2 


= .013. 


= . 12 . 


6.23. Suppose 300 misprints are distributed randomly throughout a book of 500 pages. 
Find the probability P that a given page contains (i) exactly 2 misprints, (ii) 2 or 
more misprints. 

We view the number of misprints on one page as the number of successes in a sequence of 
Bernoulli trials. Here n = 300 since there are 300 misprints, and p = 1/500, the probability that 
a misprint appears on the given page. Since p is small, we use the Poisson approximation to the 
binomial distribution with \ = np = 0.6. 

(i) P = p{2; 0.6) = - = (,36)(.549)/2 = .0988 « .1. 

( fOO>>-0.6 

(ii) P(0 misprints) = - = e~ 0 Q = .549 

P(1 misprint) = °' 8 = (,6)(.549) = .329 

Then P = 1 — P(0 or 1 misprint) = 1 — (.549 + .329) = .122. 


6.24. 


Suppose 2% of the items made by a factory are defective. Find the probability P 
that there are 3 defective items in a sample of 100 items. 

The binomial distribution with n = 100 and p = .02 applies. However, since p is small, we 
use the Poisson approximation with X = np = 2. Thus 


P = p(3; 2) 


23^-2 


8(.136)/6 = .180 


3 ! 
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6.25. Show that the Poisson distribution p(k; A) is a probability distribution, i.e. 

oo 

2 p(fc; a) = l 

Jc = 0 

oo 

By known results of analysis, e x = 2 X k /k\. Hence 

k = 0 

2 P(k; X) = 2 -f, = 2 X fc /fc! = = 1 

fc—o k = 0 fc~0 


6.26. Prove Theorem 6.5: Let X be a random variable with the Poisson distribution 
p(k; A). Then (i) E(X) = A and (ii) Var (X) - A. Hence <r x = yT. 


(i) Using p{k\\) = \ k e~ x /k\, we obtain 


#(A) = 2 k • p(fc; X) = 2 & 


Jc —0 


fc = 0 


X fc e~ 

kl 


= x 5 


X^e-x 


L fc =i (/c — 1)! 

(we drop the term A; = 0 since its value is zero, and we factor out X from each term). Let 
s = k — 1 in the above sum. As k runs through the values 1 to °°, s runs through the values 
0 to oo. Thus 


E(X) = X 2 


X s e~ 


~ x 2 p($; x) = x 


since 2 P( s ; X) = 1 by the preceding problem. 

s = 0 

(ii) We first compute E(X 2 ): 

E(X 2) = 2 fc 2 p(fc; X) = 2 & 2 

fc =0 fc =0 

Again we let s = k — 1 and obtain 

tf(X 2 ) = X 2 (8 + 1) ,, 


s = 0 


xfce x = X 2 k Xk le X 


fc! 


k=i (*-D! 


\Sp—\ ™ 

= X 2 (s+l)p(s;X) 


But 


2 (s + 1) p(s; X) = 2 sp(s; X) + 2 p(s; X) = X + 1 

s = 0 s=0 s — 0 


where we use (i) to obtain X and the preceding problem to obtain 1. Accordingly, 

E(X 2 ) — X(X H - 1) — X^ *4“ X 

and Var (X) = E(X 2 ) - 4 = X* + X - X* = x 

Thus the theorem is proved. 


6.27. Show that if p is small and n is large, then the binomial distribution is approximated 
by the Poisson distribution; that is, b(k ; n,p) p(k; A) where A = np . 

We have 6(0; n,p) = (1 — p) n = (1 — X/ri) n . Taking the natural logarithm of both sides, 

In 6(0; n,p) = n In (1 — Xln) 

The Taylor expansion of the natural logarithm is 



and so 
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Therefore if n is large, 

In 6(0; n, p) = nln(l-^ = - * - £ -- "X 
and hence 6(0; n,p) « e~~ x . 

Furthermore if p is very small and hence q « 1, we have 

6(fc; n, p) (to - k + l)p X - (ft - jjp ^ X 

6(ft — 1;to,p) ~~ kq ~~ kq k 

That is, b(k;n,p) -rb(k—l;n,p). Thus using b(0;n,p) ~ e~ x , we obtain 6(1; to, p) » Xe _x , 
6(2; to, p) X 2 e _x /2 and, by induction, 6(ft; n, p) « - ^ ■ - = p(ft; X). 

MISCELLANEOUS PROBLEMS 

6.28. The painted light bulbs produced by a company are 50% red, 30% blue and 20% green. 
In a sample of 5 bulbs, find the probability P that 2 are red, 1 is green and 2 are blue. 

By Theorem 6.6 on the multinomial distribution, 

P = 271727 (-5)2 (.3) (.2)2 = .09 


6.29. Show that the normal distribution 

f(x) = 




q-Vz(x~ n) 2 /<r 2 


is a continuous probability distribution, i.e. [* f(x) dx = 1. 

— CO 

/ » 

fix) dx , we obtain the integral 

I = -j= f e ~ t%/2 dt 

\2tT J-CC 

It suffices to show that 7 2 = 1. We have 

1 f* °° S* 00 S* oO «5 

l z = 27 J e- t2 «dtj a'**' 2 ds = ~ j j e -<» ! + tb/2 

“ 00 —oo 00 — 00 

We introduce polar coordinates in the above double integral. Let 8 = r cos 0 and t = r sin t 
Then dsdt = rdrde, and 0 - e — 2tt and 0-r-«. That is, 


I 2 = 


1 s*2ir 

2 TJ o J o re-P^drde 


But f dr = e _r * /2 J = 1 

l c 2n 

Hence l 2 ~ — J de — 1 and the theorem is proved. 


6.30. Prove Theorem 6.3: Let X be a random variable with the normal distribution 


f(x) = 


T\/27r 


- Vzix—nr/a 3 


Then (i) E(X ) — ^ and (ii) Var (X) = o- 2 . Hence o x = <r. 

1 C 60 

(i) By definition, £?(X) = —— I dx. Setting i = (x - rf/a, we obtain 

tx\ 9.«r J — ~ 
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E(X) = -j= f 


(at + fj)e fZ/2 dt 


= — f 

>/£ J- 


_ - te * 12 dt -f n _ 

v2tt \2v 


h= fe- 

2,r 


™ dt 


J oo 

te~ t2/2 dt = 0. Further- 

more, —— ) 
a/2tt 
as claijned. 


e t z ' 2 dt — 1 by the preceding problem. Accordingly, E(X) — —• 0 + /x • 1 = fi 

3 y27r 


1 z' 00 

(ii) By definition, E(X 2 ) ~— -— I x 2 e^^ x ~~^ 2/a& dx, Again setting t — (x — ^)h f we obtain 

ay^TT - on 


tf(X 2 ) 




(af + /,)%-' a /2 dt 


i r* 


t 2 e -t 2 /2 dt + 2 M <t 


i_ r 

^ J- 




te-™ dt + n- 


?-}= f <-*» dt 


which reduces as above to E(X 2 ) = o 2 —C t 2 e -< 2 /2 ( it + ffi 

y[%r 


We integrate the above integral by parts. Let u ~ t and dv — te~ t2/2 dt. 
and du = dt. Thus 


-L f t 2 e^ 2 dt 

\2tir ** -oo 



—te-** 2 


+- 4 = r «= 

V 2tT ^ — CO 


Then v — —e~ t2/2 


0 + 1 = 1 


Consequently, E(X 2 ) ~ a 2 • 1 + /x 2 = a 2 + /x 2 and 

Var (X) = (X 2 ) - M 2 = a 2 + /x 2 - ^ 2 = <r 2 

Thus the theorem is proved. 


Supplementary Problems 

BINOMIAL DISTRIBUTION 

6.31. Find (i) 6(1; 5,1), (ii) 6(2; 7, i), (iii) 6(2; 4, J). 

6.32. A card is drawn and replaced three times from an ordinary deck of 52 cards. Find the probability 
that (i) two hearts are drawn, (ii) three hearts are drawn, (iii) at least one heart is drawn. 

6.33. A baseball player’s batting average is .300. If he comes to bat 4 times, what is the probability 
that he will get (i) two hits, (ii) at least one hit? 

6.34. A box contains 3 red marbles and 2 white marbles. A marble is drawn and replaced three times 

from the box. Find the probability that (i) 1 red marble was drawn, (ii) 2 red marbles were 

drawn, (iii) at least one red marble was drawn. 

6.35. Team A has probability J of winning whenever it plays. If A plays 4 games, find the probability 

that A wins (i) 2 games, (ii) at least 1 game, (iii) more than half of the games. 

6.36. A card is drawn and replaced in an ordinary deck of 52 cards. How many times must a card be 
drawn so that (i) there is at least an even chance of drawing a heart, (ii) the probability of 
drawing a heart is greater than £? 
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6.37. The probability of a man hitting a target is (i) If he fires 5 times, what is the probability of 
hitting the target at least twice? (ii) How many times must he fire so that the probability of 
hitting the target at least once is more than 90%. 

6.38. The mathematics department has 8 graduate assistants who are assigned to the same office. Each 
assistant is just as likely to study at home as in the office. How many desks must there be in the 
office so that each assistant has a desk at least 90% of the time? 

6.39. Of the bolts produced by a factory, 2% are defective. In a shipment of 3600 bolts from the factory, 
find the expected number of defective bolts and the standard deviation. 

6.40. A fair die is tossed 1620 times. Find the expected number of times the face 6 occurs and the 
standard deviation. 

6.41. Let X be a binomially distributed random variable with E(X) = 2 and Var (X) = 4/3. Find the 
distribution of X. 

6.42. Consider the binomial distribution P{k) — b(k ; n, p). Show that: 

(\\ - „ &(fc; n, p) (n-k + 1 )p 

P(k — 1) b(k — 1; n, p) kq 

(ii) P(k) > P(k - 1) for k < (n + l)p and P(k) < P(k - 1) for k > (w + l)p. 

NORMAL DISTRIBUTION 

6.43. Let 0 be the standard normal distribution. 

(i) Find <f>( J), 0(^) and 0(-f). 

(ii) Find t such that (a) 0(i) = .100, ( b ) 0(t) = .2500, (c) 0(e) = .4500. 

6.44. Let A be a random variable with the standard normal distribution 0. Find: 

(i) P(-.81 ~X— 1.13), (ii) P(.53 — X — 2.03), (iii) P(X .73), (j v ) p(\x\ =£ i). 

6.45. Let X be normally distributed with mean 8 and standard deviation 4. Find: 

(i) P(5 — X — 10), (ii) P(10 — X — 15), (iii) P(X — 15), (iv) P(A=£ 5). 

6.46. Suppose the weights of 2000 male students are normally distributed with mean 155 pounds and 

standard deviation 20 pounds. Find the number of students with weights (i) less than or equal to 

100 pounds, (11) between 120 and 130 pounds, (iii) between 150 and 175 pounds, (iv) greater than 
or equal to 200 pounds. 

6.47. Suppose the diameters of bolts manufactured by a company are normally distributed with mean 

.25 inches and standard deviation .02 inches. A bolt is considered defective if its diameter is 
- .20 inches or - .28 inches. Find the percentage of defective bolts manufactured by the company. 

6.48. Suppose the scores on an examination are normally distributed with mean 76 and standard deviation 

15. The top 15% of the students receive A’s and the bottom 10% receive F’s. Find (i) the minimum 
score to receive an A and (ii) the minimum score to pass (not to receive an F). 

NORMAL APPROXIMATION TO THE BINOMIAL DISTRIBUTION 

6.49. A fair coin is tossed 10 times. Find the probability of obtaining between 4 and 7 heads inclusive 
by using (i) the binomial distribution, (ii) the normal approximation to the binomial distribution. 

6.50. A fair coin is tossed 400 times. Find the probability that the number of heads which occur differs 
from 200 by (i) more than 10, (ii) more than 25. 

6.51. A fair die is tossed 720 times. Find the probability that the face 6 will occur (i) between 100 and 
125 times inclusive, (ii) more than 150 times. 

6.52. Among 625 random digits, find the probability that the digit 7 appears (i) between 50 and 60 times 
inclusive, (ii) between 60 and 70 times inclusive. 
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POISSON DISTRIBUTION 

6.53. Find (i) e~^ t (ii) 

6.54. For the Poisson distribution p(k; \), find (i) p( 2; 1.5), (ii) p( 3; 1), (iii) p( 2; .6). 

6.55. Suppose 220 misprints are distributed randomly throughout a book of 200 pages. Find the probability 
that a given page contains (i) no misprints, (ii) 1 misprint, (iii) 2 misprints, (iv) 2 or more 
misprints. 

6.56. Suppose 1% of the items made by a machine are defective. Find the probability that 3 or more items 
are defective in a sample of 100 items. 

6.57. Suppose 2% of the people on the average are left-handed. Find the probability of 3 or more left- 
handed among 100 people. 

6.58. Suppose there is an average of 2 suicides per year per 50,000 population. In a city of 100,000 find 
the probability that in a given year there are (i) 0, (ii) 1, (iii) 2, (iv) 2 or more suicides. 


MULTINOMIAL DISTRIBUTION 

6.59. A die is “loaded” so that the face 6 appears .3 of the time, the opposite face 1 appears .1 of the 
time, and each of the other faces appears .15 of the time. The die is tossed 6 times. Find the 
probability that (i) each face appears once, (ii) the faces 4, 5 and 6 each appears twice. 

6.60. A box contains 5 red, 3 white and 2 blue marbles. A sample of six marbles is drawn with replace¬ 
ment, i.e. each marble is replaced before the next one is drawn. Find the probability that: (i) 3 are 
red, 2 are white and 1 is blue; (ii) 2 are red, 3 are white and 1 is blue; (iii) 2 of each color appears. 


Answers to Supplementary Problems 


6.31. 

(i) 

80 
243 J 

, (H) 


(iii) 

27 

128 

6.32. 

(i) 

9 

64’ 

(ii) 

A’ 

(iii) 

3 7 
64 

6.33. 

(i) 

0.2646, 

(ii) 

0.7599 

6.34. 

(i) 

36 
125 ; 

. (ii) 

54 

12|5 ’ 

(iii) 

117 

125 

6.35. 

(i) 

216 

625 

, (ii) 

544 

625’ 

(iii) 

112 

625 

6.36. 

(i) 

3, 

(ii) 5 




6.37. 

(i) 

131 

243 

, (ii) 

6 




6.38. 6 

6.39. fi = 72, a =t 8.4 

6.40. n = 270, a =t 16 



0 

1 

2 

3 

4 

5 

6 

/(* i) 

64/729 

192/729 

240/729 

160/729 

60/729 

12/729 

1/729 


Distribution of X with n — 6 and p — 1/3 
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6.43. (i) 0(f) = .3867, 0(f) = .3621, 0(-£) = .3011. 

(ii) (a) t — ±1.66, (6) t = ±.97, (c) there is no value of t. 

6.44. (i) .2910 + .3708 = .6618, (ii) .4788 - .2019 = .2769, (iii) .5000 + .2673 = .7673, (iv) 2(.0987) = .1974. 

6.45. (i) .4649, (ii) .2684, (iii) .0401, (iv) .2266 

6.46. (i) 6, (ii) 131, (iii) 880, (iv) 24 

6.47. 7.3% 

6.48. (i) 92, (ii) 67 

6.49. (i) .7734, (ii) .7718 

6.50. (i) .2938, (ii) .0108 

6.51. (i) .6886, (ii) .0011 

6.52. (i) .3518, (ii) .5131 

6.53. (i) .202, (ii) .100 

6.54. (i) .251, (ii) .0613, (iii) .0988 

6.55. (i) .333, (ii) .366, (iii) .201, (iv) .301 

6.56. 0.080 

6.57. 0.325 

6.58. (i) .0183, (ii) .0732, (iii) .1464, (iv) .909 

6.59. (i) .0109, (ii) .00103 

6.60. (i) .135, (ii) .0810, (iii) .0810 




Chapter 7 


Markov Chains 


INTRODUCTION 

We review the definitions and elementary properties of vectors and matrices which are 
required for this chapter. 

By a vector u we simply mean an n-tuple of numbers: 

U = (Ul, U 2 , . . . , Un) 

The Ui are called the components of u . If all the m = 0, then u is called the zero vector . By 
a scalar multiple ku of u (where k is a real number), we mean the vector obtained from u 
by multiplying its components by k: 

ku ~ ( kui, ku 2f .. ., ku n ) 

We note that two vectors are equal if and only if their corresponding components are 
equal. 

By a matrix A we mean a rectangular array of numbers: 

ttll &12 * * * tti n 

&21 &22 * * * &2n 

a m 2 * * • a r nn 

The m horizontal n-tuples 

(ail, tti2, . . ., (Xin), (&21, ^22, . • . , Ci2n)f . - • , (&ml, Ctm2, • • . > ^mn) 

are called the rows of A, and the n vertical m-tuples 

i an \ 

a 2 i 

^ Uml I 

its columns. Note that the element a y, called the ij-entry, appears in the ith row and the 
yth column. We frequently denote such a matrix simply by A = (ay). 

A matrix with m rows and n columns is said to be an m by n matrix, written mxn 
matrix; if m = tt, then it is called a square matrix (or: n-square matrix). We also note 
that a matrix with only one row may be viewed as a vector, and vice versa. 

Now suppose A and B are two matrices such that the number of columns of A is equal 
to the number of| rows of B, say A is an m X p matrix and B is a p x n matrix. Then the 
product of A and B, written AB, is the m X n matrix whose ij-e ntry is obtained by multiply¬ 
ing the elements of the ith row of A by the corresponding elements of the yth column of B 
and then adding: 


aw ^ 


1 din ^ 

a 2 2 

| 9 • • • y 

a2n ^ 

a m2 i 


lamn I 
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an 


a ip 

dn • ' * dip 

dm 1 

. . . 



\r 


bn 


bm\ i 

: 

... 

.. 


) l 

/ \6pi 


b Pi 


bp nj 


where 


Cij — Oii&ij "1“ (Li 2&2j “1“ * “1“ (Lipbpj — (Likbkj 


If the number of columns of A is not equal to the number of rows of B, say A is m X p 
and B is q X n where p ¥* q, then the product AB is not defined. 

There are special cases of matrix multiplication which are of special interest. If A 
is an n-square matrix, then we can form all the powers of A: 

A 2 = AA, A 3 = AA 2 , A 4 = AA 3 , ... 

In addition, if u is a vector with n components, then we can form the product 

uA 

which is again a vector with n components. We call u ¥* 0 a fixed vector (or: fixed point) 
of A, if u is “left fixed”, i.e. is not changed, when multiplied by A: 

uA — u 

In this case, for any scalar k ¥* 0, we have 

( ku)A = k(uA) — ku 

That is, 

Theorem 7.1: If u is a fixed vector of a matrix A, then every nonzero scalar multiple ku 
of u is also a fixed vector of A. 


Example 7.1: 


r s\ia x a 2 a 3 
t w JUi b 2 b 3 


ra x 4* sbj ra 2 + sb 2 ra 3 + sb 3 
ta x + ub x ta 2 + ub 2 ta 3 -f ub 3 


Example 7.2: If A = ^ 

A 2 = 


1 2 ) 

3 4 / * 


1 2\/l 2 

3 4 / V 3 4 


1+6 2+8 
3 + 12 6 + 16 


7 10 

15 22 


1 2 3 

Example 7.3: (1,2,3) 4 5 6 

\7 8 9 


= (1 + 8 + 21,2 + 10 + 24,3 + 12 + 27) = (30,36,42) 


Example 7.4: Consider the matrix A = 
of A. For, 


A — (2, —1) 


. Then the vector u = (2, —1) is a fixed point 


= (2 • 2 - 1 • 2, 2 • 1 - 1 • 3) = (2,-1) = u 


Thus by the above theorem, the vector 2 u = (4, —2) is also a fixed point of A: 

.. „ /2 l\ ......... 


(4, -2) 


(4 • 2 - 2 • 2, 4 • 1 - 2 • 3) = (4,-2) 


PROBABILITY VECTORS, STOCHASTIC MATRICES 

A vector u = (ui, U 2 , ..., it«) is called a 'probability vector if the components are non¬ 
negative and their sum is 1. 
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Example 7.5: 


Consider the following vectors: 

u = (f > 0, %), v = (f, 0, £) and w = (J, 0, £) 

Then: 

u is not a probability vector since its third component is negative; 
v is not a probability vector since the sum of its components is greater than 1; 
w is a probability vector. 


Example 7.6: The nonzero vector v = (2,3, 5, 0,1) is not a probability vector since the sum of 
its components is 2 + 3 + 5 + 0 + 1 = 11. However, since the components of v 
are nonnegative, v has a unique scalar multiple \v which is a probability vector; 
it can be obtained from v by multiplying each component of v by the reciprocal 
of the sum of the components of v : = (h» n> n f If)* 

A square matrix P = (pa) is called a stochastic matrix if each of its rows is a probability 
vector, i.e. if each entry of P is nonnegative and the sum of the entries in each row is 1. 


Example 7.7: 


Consider the following matrices: 

(\ 0 *\ 

I \ -i 

\i i \1 

(i) 



(ii) 


v /° 

i 

°\ 

) w 

* 

* 

’ U 

« 

0/ 


(ni) 



(i) is not a stochastic matrix since the entry in the second row and third column 
is negative; 

(ii) is not a stochastic matrix since the sum of the entries in the second row is 
not 1; 

(iii) is a stochastic matrix since each row is a probability vector. 


We shall prove (see Problem 7.10) 

Theorem 7.2: If A and B are stochastic matrices, then the product AB is a stochastic 
matrix. Therefore, in particular, all powers A n are stochastic matrices. 


REGULAR STOCHASTIC MATRICES 

We now define an important class of stochastic matrices whose properties shall be 
investigated subsequently. 

Definition: A stochastic matrix P is said to be regular if all the entries of some power 
P m are positive. 


Example 7.8: The stochastic matrix A — 


A* = 

is positive in every entry. 



is regular since 



Example 7.9: Consider the stochastic matrix A 



Here 



In fact every power A m will have 1 and 0 in the first row; hence A is not regular. 
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FIXED POINTS AND REGULAR STOCHASTIC MATRICES 

The fundamental property of regular stochastic matrices is contained in the following 
theorem whose proof lies beyond the scope of this text. 

Theorem 7.3: Let P be a regular stochastic matrix. Then: 

(i) P has a unique fixed probability vector t, and the components of t 
are all positive; 

(ii) the sequence P, P 2 , P 3 ,... of powers of P approaches the matrix T 
whose rows are each the fixed point t; 

(iii) P is any probability vector, then the sequence of vectors 
pP, pP 2 , pP 3 , ... approaches the fixed point t. 

Note: P" approaches T means that each entry of P" approaches the corresponding 
entry of T, and pP n approaches t means that each component of pP n approaches the cor¬ 
responding component of t. 


Example 7.10: Consider the regular stochastic matrix P = ^ . We seek a probability 

vector with two components, which we can denote by t = (x, 1 - x), such that 

, f° l\ 

(*, i-*)^ J = (*, 1-x) 

Multiplying the left side of the above matrix equation, we obtain 

• — = * 


(i — h x ’ i + i x ) = (*. 1 — x) or 


i + £* = i 


or x = i 


Thus t - (1, 1 - i) _ (1, |) is the unique fixed probability vector of P. By 
Theorem 7.3, the sequence P, P 2 , P 3 , . .. approaches the matrix T whose rows are 
each the vector t: 

r = /i l\ _ /.33 .67' 

U 1/ V33 .67 

We exhibit some of the powers of P to indicate the above result: 


P2 - 


i t 


/.50 .50 
\.25 .75 


p3 


i t 


.25 .75 

.37 .63 


p4 


^ ^ ^ ,63> \ * ps = /i 5 e ie\ = /.; 

A **/ V3i m )’ Vh h ) 0 


_ /.31 .69 
34 .66 


Example 7.11: Find the unique fixed probability vector of the regular stochastic matrix 



P = 


Method 1. We seek a probability vector with three components, which we can 
represent by t - (x, y t l-x-y) t such that tP = t: 

fo 1 o y 

(x, y, 1 — x — y) I 0 0 1 ] = (x, y, 1 — x — y) 

\i i 0/ 
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Multiplying the left side of the above matrix equation and then setting correspond¬ 
ing components equal to each other, we obtain the system 

i — \y ~ x % x + y ■= i 

x — l 

° r x 3 2 / — —i or ° 

y — 

y = 1 — x — y x + 2y = 1 

Thus t — (l, |) is the unique fixed probability vector of P. 


Method 2. We first seek 

any fixed vector u 

= (*, V, «) 

of the matrix P: 

1° 1 

°\ 


f ¥ = x 

(x, y,z) 0 0 

1 = (*, y, z) 

or 

= y 

W i 

0/ 


[ V = z 


We know that the system has a nonzero solution; hence we can arbitrarily assign 
a value to one of the unknowns. Set z = 2. Then by the first equation x — 1, 
and by the third equation y = 2. Thus u = (1, 2, 2) is a fixed point of P. But 
every multiple of u is a fixed point of P; hence multiply u by ± to obtain the re¬ 
quired fixed probability vector t = ±u — (^, §, f). 


MARKOV CHAINS 

We now consider a sequence of trials whose outcomes, say, Xi,X 2 , ...» satisfy the 
following two properties: 

(i) Each outcome belongs to a finite set of outcomes {ai,a 2 , .. .,Om} called the state space 
of the system; if the outcome on the nth trial is ai, then we say that the system is in 
state Oi at tame n or at the nth step. 

(ii) The outcome of any trial depends at most upon the outcome of the immediately pre¬ 
ceding trial and not upon any other previous outcome; with each pair of states ( a if aj) 
there is given the probability Pa that a$ occurs immediately after a { occurs. 

Such a stochastic process is called a (finite) Markov chain . The numbers Pij, called the 
transition probabilities, can be arranged in a matrix 


Pu 

P 12 

' * Plm \ 

P 21 

P22 ‘ ‘ 

’ * P2m j 

Pml 

Pm2 

‘ * pmm I 


called the transition matrix . 

Thus with each state a { there corresponds the ith row (pn, P&, ..., Pim) of the transition 
matrix P ; if the system is in state Oi, then this row vector represents the probabilities of 
all the possible outcomes of the next trial and so it is a probability vector. Accordingly, 

Theorem 7.4: The transition matrix P of a Markov chain is a stochastic matrix. 

Example 7.12: A man either drives his car or takes a train to work each day. Suppose he never 
takes the train two days in a row; but if he drives to work, then the next day 
he is just as likely to drive again as he is to take the train. 

The state space of the system is {t (train), d (drive)}. This stochastic process 
is a Markov chain since the outcome on any day depends only on what happened 
the preceding day. The transition matrix of the Markov chain is 


t d 
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The first row of the matrix corresponds to the fact that he never takes the 
train two days in a row and so he definitely will drive the day after he takes the 
train. The second row of the matrix corresponds to the fact that the day after 
he drives he will drive or take the train with equal probability. 

Example 7.13: Three boys A, B and C are throwing a ball to each other. A always throws the ball 
to B and B always throws the ball to C ; but C is just as likely to throw the ball 
to B as to A. Let X n denote the nth person to be thrown the ball. The state space 
of the system is {A,B,C}. This is a Markov chain since the person throwing the 
ball is not influenced by those who previously had the ball. The transition matrix 
of the Markov chain is 

ABC 

A 1 0 1 0\ 

B O 0 1 

C U i 0/ 

The first row of the matrix corresponds to the fact that A always throws the 
ball to B . The second row corresponds to the fact that B always throws the ball 
to C. The last row corresponds to the fact that C throws the ball to A or B with 
equal probability (and does not throw it to himself). 


Example 7.14: A school contains 200 boys and 150 girls. One student is selected after another 
to take an eye examination. Let X n denote the sex of the Tith student who takes 
the examination. The state space of the stochastic process is (m (male), /(female)}. 
However, this process is not a Markov chain since, for example, the probability 
that the third person is a girl depends not only on the outcome of the second trial 
but on both the first and second trials. 


Example 7.15: 


(Random walk with reflecting barriers.) A man is at an integral point on the 
x-axis between the origin O and, say, the point 5. He takes a unit step to the 
right with probability p or to the left with probability q — 1 — p, unless he is at 
the origin where he takes a step to the right to 1 or at the point 5 where he takes 
a step to the left to 4. Let X n denote his position after n steps. This is a Markov 


point i. 


P = 


a 2> a 3> a 4> 

a 5 } where 

a i 

means 

x is 







a 0 

«i 

a 2 

a 3 

a 4 

a o 

/» 

1 

0 

0 

0 

ai 

1 < 

0 

V 

0 

0 

a 2 

0 

Q 

0 

V 

0 

a S 

l 0 

0 

7 

0 

P 

a 4 

\ 0 

0 

0 

<7 

0 

«5 

\o 

0 

0 

0 

1 


a 5 
0 \ 
0 
0 
0 
p 

0 , 


Each row of the matrix, except the first and last, corresponds to the fact that 
the man moves from state a { to state a {+1 with probability p or back to state Q>i-\ 
with probability q — 1 — p. The first row corresponds to the fact that the man 
must move from state a 0 to state a lf and the last row that the man must move 
from state a 5 to state a 4 . 


HIGHER TRANSITION PROBABILITIES 

The entry Pa in the transition matrix P of a Markov chain is the probability that the 
system changes from the state cu to the state dj in one step: a\ a,. Question: What is 
the probability, denoted by p^\ that the system changes from the state (h to the state a, in 
exactly n steps: 

<h -> a kl -> a k2 •+ • * • a kn _ 1 -* a. 
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The next theorem answers this question; here the p^ n) are arranged in a matrix P <n) called 
the n-step transition matrix: 

Theorem 7.5: Let P be the transition matrix of a Markov chain process. Then the n-step 
transition matrix is equal to the nth power of P; that is, P (n) = P n . 

Now suppose that, at some arbitrary time, the probability that the system is in state 
di is pr, we denote these probabilities by the probability vector p = (pi, p 2 , ..., p m ) which 
is called the probability distribution of the system at that time. In particular, we shall let 

pCO) = (pCO> ; p(0) > ..., p(0>) 

denote the initial probability distribution, i.e. the distribution when the process begins, 
and we shall let 

P (B) = (P 1 (n) , P?\ ....Pm 5 ) 

denote the nth step probability distribution, i.e. the distribution after the first n steps. 
The following theorem applies. 


Theorem 7.6: Let P be the transition matrix of a Markov chain process. If p = (pi) is 
the probability distribution of the system at some arbitrary time, then 
pP is the probability distribution of the system one step later and pP n is 
the probability distribution of the system n steps later. In particular, 

p ai = p (oy P, p w = p w P, p (3) = p m P, . .., pM — pWpn 


Example 7.16: Consider the Markov chain of Example 7.12 whose transition matrix is 


P 


t 

d 


t d 



Here t is the state of taking a train to work and d of driving to work, 
ample 7.8, 


P4 — p2 . p2 




By Ex- 


Thus the probability that the system changes from, say, state t to state d in exactly 
4 steps is §, i.e. p ( <« = f. Similarly, p< 4 > = f, p< t 4) = f\- and = fl. 


Now suppose that on the first day of work, the man tossed a fair die and drove 
to work if and only if a 6 appeared. In other words, p (0) = (§, -jl) is the initial 
probability distribution. Then 


p< 4 > = P (0) ^ 4 - (f, i) 


£ § 
5 iW » ® 

JL 11 
16 16 


— 61 v 

UK' 


is the probability distribution after 4 days, i.e. p ( { 4) — jg and p_ 


(4) = 


Example 7.17: Consider the Markov chain of Example 7.13 whose transition matrix is 

ABC 
A /0 1 0\ 

P = B I 0 0 1 

C \* i 0/ 
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Suppose C was the first person with the ball, i.e. suppose p (0) = (0,0,1) is the 
initial probability distribution. Then 


r 0 1 O' 

r)(l> = p«»p = (0, 0, 1) I 0 0 1 


i °J 


f 0 1 O' 

d( 2) = pd)p - (£, 0) I 0 0 1 


P (3) = p(2)p 3 = (0, |) 


U * 0/ 

/o 1 (0 
0 0 1 
U i °y 


= (*■ 0) 


- to,*.*) 




Thus, after three throws, the probability that A has the ball is J, that B has the 
ball is l and that C has the ball is pj 3) = 1, p^ 3 > = | and p£ 3) = 


Example 7.18: Consider the random walk problem of Example 7.15. Suppose the man began at the 
point 2; find the probability distribution after 3 steps and after 4 steps, i.e. p ^ 3 > 
and p (4) . 

Now p (0) = (0,0,1, 0, 0,0) is the initial probability distribution. Then 
p(i) = p<o>p — (o, q, 0, p, 0, 0) 

p(2) — pU)p = (q2 f o, 2 pq, 0, p 2 , 0) 

p(3) = p(2)p = (0, g 2 + 2pg 2 , 0, 3p 2 g, 0, p 3 ) 

p (4) = p<3>P ~ (^3 + 2p^ 3 , 0, pq 2 + 5p 2 ^ 2 , 0, 3p 3 qr + p 3 , 0) 

Thus after 4 steps he is at, say, the origin with probability q 3 4* 2 pq 3 . 


STATIONARY DISTRIBUTION OF REGULAR MARKOV CHAINS 

Suppose that a Markov chain is regular, i.e. that its transition matrix P is regular. 
By Theorem 7.3 the sequence of w-step transition matrices P n approaches the matrix T 
whose rows are each the unique fixed probability vector t of P; hence the probability pf. n) 
that cij occurs for sufficiently large n is independent of the original state and it approaches 
the component t, of t. In other words, 

Theorem 7.7: Let the transition matrix P of a Markov chain be regular. Then, in the 
long run, the probability that any state cq occurs is approximately equal 
to the component tj of the unique fixed probability vector t of P. 

Thus we see that the effect of the initial state or the initial probability distribution of 
the process wears off a$ the number of steps of the process increase. Furthermore, every 
sequence of probability distributions approaches the fixed probability vector t of P, called 
the stationary distribution of the Markov chain. 

Example 7.19: Consider the Markov chain process of Example 7.12 whose transition matrix is 

t d 
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By Example 7.10, the unique fixed probability vector of the above matrix is (£, ■§). 
Thus, in the long run, the man will take the train to work of the time, and drive 
to work the other | of the time. 


Example 7.20: Consider the Markov chain process of Example 7.13 whose transition matrix is 

ABC 
A /o 1 o\ 

P = BOO 1 

C U i 0 ) 


By Example 7.11, the unique fixed probability vector of the above matrix is 
i’ f )• Thus, in the long run, A will be thrown the ball 20% of the time, and 
B and C 40% of the time. 


ABSORBING STATES 

A state Oi of a Markov chain is called absorbing if the system remains in the state a* 
once it enters there. Thus a state is absorbing if and only if the ith row of the transition 
matrix P has a 1 on the main diagonal and zeros everywhere else. (The main diagonal of 
an w-square matrix A = (ay) consists of the entries an, a 2 2 , . . ., a nn .) 


Example 7^21: 


Suppose the following matrix is the transition matrix of a Markov chain: 


cti &2 U3 C &4 0-5 


P 


/* 

o 

1 

2 

\° 

\o 


“i /i 
a 2 
a 3 
a 4 
«5 


0 

1 

0 

1 

0 


1 i\ 

0 0 ' 

i o 
0 0 
o 1, 


The states a 2 and a 5 are each absorbing, since each of the second and fifth rows 
has a 1 on the main diagonal. 


Example 7.22: (Random walk with absorbing barriers.) Consider the random walk problem of 

Example 7.15, except now we assume that the man remains at either endpoint when¬ 
ever he reaches there. This is also a Markov chain and the transition matrix is 
given by 

CIq a 3 a 4 a 5 

a 0 y;i; 0 0 0 0 0 \ 

cq / q 0 p 0 0 0\ 

p = a 2 i0q0p00\ 

a 3 I 0 0 q 0 p 0 I 

a 4 \o 0 0 q 0 p 

a 5 \ 0 0 0 0 0 1 / 

We call this process a random walk with absorbing barriers, since the a 0 and 
a 5 are absorbing states. In this case, denotes the probability that the man 
reaches the state a 0 on or before the rcth step. Similarly, denotes the proba¬ 
bility that he reaches the state a 5 on or before the nth step. 


Example 7.23: A player has, say, x dollars. He bets one dollar at a time and wins with probability 
p and loses with probability q — 1 — p. The game ends when he loses all his 
money, i.e. has 0 dollars, or when he wins IV — x dollars, i.e. has N dollars. This 
game is identical to the random walk of the preceding example except that here 
the absorbing barriers are at 0 and N. 
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Example 7.24: A man tosses a fair coin until 3 heads occur in a row. Let X n = k if, at the nth 
trial, the last tail occurred at the (n - k )-th trial, i.e. X n denotes the longest string 
of heads ending at the nth trial. This is a Markov chain process with state space 
{a 0 , a u a 2 y a 3 }, where a { means the string of heads has length L The transition 
matrix is 

a Q ttj d 2 ftg 

a 0 /* i 0 0\ 

«i | i 0 i 0 

“2 1 i 0 0 i 

a 3 \0 0 0 1/ 

Each row, except the last, corresponds to the fact that a string of heads is 
either broken if a tail occurs or is extended by one if a head occurs. The last 
line corresponds to the fact that the game ends if three heads are tossed in a row. 
Note that a 3 is an absorbing state. 

Let (h be an absorbing state of a Markov chain with transition matrix P. Then, for 
j ¥* i, the n-step transition probability p^ n) — 0 for every n. Accordingly, every power of 
P has a zero entry and so P is not regular. Thus: 

Theorem 7.8: If a stochastic matrix P has a 1 on the main diagonal, then P is not regular 
(unless P is a 1 x 1 matrix). 


Solved Problems 

MATRIX MULTIPLICATION 

/I 3 -1 

7.1. Let u = (1, —2,4) and A = ( 0 2 5 

\4 1 6 

The product of the vector u with 3 components by the 3X3 matrix A is again a vector with 3 
components. To obtain the first component of uA, multiply the elements of u by the corresponding 
elements of the first column of A and then add: 

j 1 3 -1\ 

( 1, -2, 4 ) 0 2 5 = (1 -1 + (-2)* 0 + 4- 4, , ) = (17, , ) 

\ 4 1 6/ 

To obtain the second component of uA, multiply the elements of u by the corresponding elements of 
the second column of A and then add: 

1 1 3 -A 

( Si -2, 4 ) 0 2 5 = (17, 1 • 3 + (-2) -2 + 4-1, ) = (17, 3, ) 

\4 1 «/ 

To obtain the third component of uA y multiply the elements of u by the corresponding elements of 
the third column of A and then add: 

( 1 3 "I \ 

( 1, -2, 4 ) 0 2 5 = (17, 3, 1 • (-1) + (-2) -5 + 4-6) (17, 3, 13) 

\4 1 6 / 


Find uA. 


That is, 


uA = (17, 3, 13) 
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7.2. Let A = 


2 -1 


and B = 


0 -4 

-2 6 


Find (i) AB, (ii) BA. 


(i) 


Since A is 2 X 2 and B is 2X3, the product AB is a 2 X 3 matrix. To obtain the first row of 
AB, multiply the elements of the first row (1, 3) of A by the corresponding elements of each of 


the columns 


+ 3 

2 -1 


(-“) and ( 


-4 

6 


of B and then add: 


3 -2 


1 • 2 + 3 • 3 1*0 + 3* (-2) 1 • (-4) + 3 


• 6 ^ _ ^11 -6 14 ^ 


To obtain the second row of AB, multiply the elements of the second row (2, —1) of A by the 
corresponding elements of each of the columns of B and then add: 


1 3 

2 4 


2 0-4 

3 —2 6 

11 -6 14 

2*2 + (-1) *3 2*0 + (-1) • (-2) 2 • (-4) + (-1) • 6 


11 -6 

1 2 


!4\ 

-14/ 


Thus 


AB 


11 -6 14 

1 2 -14 


(ii) Note B is 2 X 3 and A is 2 X 2. Since the “inner numbers” 3 and 2 are not equal, i.e. the num¬ 
ber of columns of B is not equal to the number of rows of A, the product BA is not defined. 


7.3. Let A 

(i) A2 


1 2 
4 -3 

— AA — 


Find (i) A 2 , (ii) A 3 . 


l 2 
4 -3 


1 2 

4 -3 


(ii) A3 = AA2 = 


1 • 1 + 2 * 4 1*2 + 2* (-3) 

4*1 + (-3) *4 4*2 + (-3) * (-3) 

1 2\/ 9 -4 

4 -3/\-8 17 

1*9 + 2* (-8) 1 • (-4) + 2 • 17 

4*9 + (-3) • (-8) 4 • (-4) + (-3) • 17 


9 -4 

-8 17 


-7 

60 


30 

-67 


PROBABILITY VECTORS AND STOCHASTIC MATRICES 
7.4. Which vectors are probability vectors? 

(i) u = a, 0, -b h i), (ii) V = a 0, h b i), (iii) w = (i, 0, 0, b b- 

A vector is a probability vector if its components are nonnegative and their sum is 1. 

(i) u is not a probability vector since its third component is negative. 

(ii) v is not a probability vector since the sum of the components is greater than 1. 

(iii) w is a probability vector since the components are nonnegative and their sum is 1. 


7.5. Multiply each vector by the appropriate scalar to form a probability vector: 

(i) (2, 1, 0, 2|, 3), (ii) (4, 0, 1, 2, 0, 5), (iii) (3, 0, -2, 1), (iv) (0, 0, 0, 0, 0). 

(i) The sum of the components is 2 + l + 0 + 3 + 2 = 8; hence multiply the vector, i.e. each 
component, by ^ to obtain the probability vector (£, 0, §). 
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(ii) The sum of the components is 4 + 0 + 1 + 2 + 04-5 — 12; hence multiply the vector, i.e. each 
component, by ^ * 0 obtain the probability vector (i' T2> 6' 

(iii) The first component is positive and the third is negative; hence it is impossible to multiply the 
vector by a scalar to form a vector with nonnegative components. Thus no scalar multiple 
of the vector is a probability vector. 

(iv) Every scalar multiple of the zero vector is the zero vector whose components add to 0. Thus 
no multiple of the zero vector is a probability vector. 


7.6. Find a multiple of each vector which is a probability vector: 

(i) (h f, 0, 2, *), (ii) (0, t, 1, t, f). 

In each case, first multiply each vector by a scalar so that the fractions are eliminated. 

(i) First multiply the vector by 6 to obtain (3, 4, 0,12, 5). Then multiply by 1/(3 + 4 + 0 + 12 + 5) = 

to obtain (1,^,0, 1,^) which is a probability vector. 

(ii) First multiply the vector by 30 to obtain (0, 20, 30,18, 25). Then multiply by 1/(0 + 20 + 30 + 
18 + 25) = ^ to obtain (0, §§> ff> §§’ If) w hich is a probability vector. 


7.7. 


Which of the following matrices are stochastic matrices? 

i i\ /rf Te\ (l 0 

o 1) M B = [z *) = ( 


(i) A = 


(iv) D 




(i) A is not a stochastic matrix since it is not a square matrix. 

(ii) B is not a stochastic matrix since the sum of the components in the last row is greater than 1. 

(iii) C is a stochastic matrix. 

(iv) D is not a stochastic matrix since the entry in the first row, second column is negative. 


7.8. 


Let A = 


b i cA 

b 2 c 2 \ be a stochastic matrix and let n = (ui, u 2 , us) be a proba- 
b 3 cj 


bility vector. Show that uA is also a probability vector. 


Ih ci\ 

a A = {n u u 2y u z ) a 2 b 2 c 2 
\ a 3 ^3 c sJ 


(u 1 a 1 + u 2 a 2 + u z a Zy u 1 b 1 + u 2 b 2 + u z b Zy u 1 c 1 + u 2 c 2 + u z c z ) 


Since the u iy a it b { and c { are nonnegative and since the products and sums of nonnegative 
numbers are nonnegative, the components of uA are nonnegative as required. Thus we only need 
to show that the sum of the components of uA is 1. Here we use the fact that + u 2 + u Zf 
a i + &i + c i> a 2 + &2 + c 2 an( i a z + ^3 + c 3 are each 1: 

+ U 2 d 2 + u 3 a 3 + u \b i + u 2 b 2 + ^3^3 + u±Ci + u 2 c 2 + u z c z 
— u 1 (a 1 -\-b l + Ci) + u 2 {a 2 + b 2 + c 2 ) + w 3 (a 3 + b z + c 3 ) 


— Ui * 1 + u 2 ■ 1 + u z • 1 = u x + u 2 + u z = 1 


7.9. Prove: If A — (da) is a stochastic matrix of order n and u — (u\, u 2 , . . ., u n ) is a 
probability vector, then uA is also a probability vector. 

The proof is similar to that of the preceding problem for the case n — 3: 
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/ “11 

a 12 

* * ’ a ln ^ 





uA = (m 1( m 2 , .. u„) “ 21 

a 22 

' * * a 2n 





\ “nl 

a n2 

a nn j 

I 




— (^ 1^11 4 ^2®2i 4 * ■ ’ 4 


u x ai 2 + u 2 a 22 + • ■ 

’ ’ + • * 

- y Vi X CL\ n 4 l M> 2 Qt 2 n "b 

•' + w n a n „) 


Since the u { and a {j are nonnegative, the components of uA are also nonnegative. Thus we 
only need to show that the sum of the components of uA is 1: 


u x a u 4 u 2 ct\ 2 i + * • • + u n a nl 4 u x a 12 4 u 2 a 22 4 • • • 4 u n a n2 4 * • • 4 %a ln 4 u 2 a 2n 4 * * • 4 u n a nn 
= u^au 4 a 12 4 • • • 4 a ln ) 4 u 2 (a 2l 4 a 22 4 • * • 4 a 2n ) 4 ■ • • 4 u n (a nl 4 a n2 4 • ■ • 4 a nn ) 

= u x • 1 4 u 2 • 1 4 • • • 4 u n * 1 — u x 4 u 2 4 • * • 4 u n = 1 

7.10. Prove Theorem 7.2: If A and B are stochastic matrices, then the product AB is a 
stochastic matrix. Therefore, in particular, all powers A n are stochastic matrices. 

The ith-row of the product matrix AB is obtained by multiplying the tth-row r x of A by 
the matrix B: s { = r { B. Since each is a probability vector and B is a stochastic matrix, by 
the preceding problem, s { is also a probability vector. Hence AB is a stochastic matrix. 


7.11. Prove: Let p — (p u p 2 , . .., p m ) be a probability vector, and let T be a matrix 
whose rows are each the same vector t - (t u t 2 , ..., £m). Then pT -t. 

Using the fact that p x 4 p 2 4 • • • 4 p m = 1, we have 

h *2 * ’ * t m 

^2 * * ’ 

h h * ’ ‘ 

= (Pi*, 4 p 2 t x 4-h p m t l9 Pi*2 + P 2 t 2 4 • * ■ 4 P m t 2f ..., p x t m 4 P 2 t m 4 * • • 4 p m t m ) 

~ ((Pi + P2 +-t Pm)hy (Pi + P2 +-1- , (Pi + P 2 + ' * * + Pm)* m ) 

= l*t m ) = (t lt t 2 , . .., t m ) = t 


= (Pl> P 2 , • • •, Pm) 


REGULAR STOCHASTIC MATRICES AND FIXED PROBABILITY VECTORS 

7.12. Find the unique fixed probability vector of the regular stochastic matrix A = 

What matrix does A n approach? 

We seek a probability vector t = (x, 1 — x) such that tA — t : 

(x, 1 - x) ^ ^ = (#, 1 — x) 

Multiply the left side of the above matrix equation and then set corresponding components equal 
to each other to obtain the two equations 

^X 4 2 ^ 4*^ 4 g- -^x — 1 X 

Solve either equation to obtain x = Thus t = (-J, is the required probability vector. 

Check the answer by computing the product tA: 

(t>i4 5 1 

\2 i 

The answer checks since tA = t . 

The matrix A n approaches the matrix T whose rows are each the fixed point t: T = 
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7.13. 


7.14. 


(i) Show that the vector u — (ft, a) is a fixed point of the general 2x2 stochastic 
1 — a a \ 
b 1 -b)‘ 


matrix P = 


(ii) Use the result of (i) to find the unique fixed probability vector of each of the 
following matrices: 


(i) »P 


A 





.3 

.2 



(6 — ab + ab f ab + a — ab) — ( 6 , a) = u. 


(ii) By (i), u — (1, J) is a fixed point of A. Multiply u by 3 to obtain the fixed point (3,2) of A 
which has no fractions. Then multiply (3,2) by 1/(3 + 2) — to obtain the required unique 
fixed probability vector (-|, ^). 

By (i), u = (^, -J) is a fixed point of B . Multiply u by 6 to obtain the fixed point (4,3), 
and then multiply by 1/(4 + 3) = 1 to obtain the required unique fixed probability vector 

By (i), u = (. 8 , .3) is a fixed point of C. Hence (8,3) and the probability vector (^, ^) 
are also fixed points of C. 


Find the unique fixed probability vector of the regular stochastic matrix 

/* i 

P = * 0 i 

\0 1 0 , 


Method 1. We seek a probability vector t = (x,y, 1 — x — y) such that tP = t: 


4 i \\ 


(x, y,l-x-y)\± 0 1 

,0 1 Oy 


(x, y ,1 - x - 3 


Multiply the left side of the above matrix equation and then set corresponding components equal 
to each other to obtain the system of three equations 


i* + = * 

\x + l — x — y = y 
+ ^y = 1 — x — y 


x — y — 0 
3x + 8 y — 4 
5 x + 6 y = 4 


4 4 

Choose any two of the equations and solve for x and y to obtain x = ^ and 2/ = n* Check the 

3 


solution by substituting for x and y into the third equation. Since 1 — x — y = the required 
fixed probability vector is t = 

Method 2 . We seek any fixed vector u = (x, y, z) of the matrix P: 


(%, y, z) 


2 l 

2 0 2 I = 
\0 1 


(x 9 y, z) 


Multiply the left side of the above matrix equation and set corresponding components equal to each 
other to obtain the system of three equations 


iv = 




\x + z = y 
+ \v - z 


x — y — 0 
^ x — 4y + 4z = 0 
x + 2y — 4z = 0 
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We know that the system has a nonzero solution; hence we can arbitrarily assign a value to one 
of the unknowns. Set y — 4. Then by the first equation x = 4, and by the third equation z — 3. 
Thus u — (4,4,3) is a fixed point of P. Multiply u by 1/(4 + 4 + 3) = ji to obtain t = = 

(A, ±, A) which is a probability vector and is also a fixed point of P. 


7.15. Find the unique fixed probability vector of the regular stochastic matrix 

/0 1 0 \ 

/»=*** 

\0 § 1/ 

What matrix does P n approach? 

We first seek any fixed vector u = (x f y, z) of the matrix P: 

/ 0 1 0 ^ 

(X, VyZ) 



= (x, y, z) 


Multiply the left side of the above matrix equation and set corresponding components equal to 
each other to obtain the system of three equations 


\y - * 



r y = 6x 


' 

" * + \y + §* = y 

iy + i* = * 

or 


6a; + Sy + 4z = 6y 

1 y + z — 3 z 

or 

•< 

k 


y — 6x 
6x + 4z ~ 
y = 2z 


3 y 


We know that the system has a nonzero solution; hence we can arbitrarily assign a value to one 
of the unknowns. Set x — 1. Then by the first equation y = 6, and by the last equation z = 3. Thus 
u = (1,6,3) is a fixed point of P. Since 1 + 6 + 3 = 10, the vector t = (^, is the 

required unique fixed probability vector of P. 


P n approaches the matrix T whose rows are each the fixed point t: T ~ 


l 

6 

3 1 

10 

1 O 

10 

l 

TO 

6 

10 

A 

1 

6 

3 

,10 

10 

10, 


7.16. If t = is a fixed point of a stochastic matrix P, why is P not regular? 

If P i$ regular then, by Theorem 7.3, P has a unique fixed probability vector, and the 
components of the vector are positive. Since the components of the given fixed probability vector 
are not all positive, P cannot be regular. 


7.17. Which of the following stochastic matrices are regular? 


(i) A = 


(i) 

(ii) 


0 1 


(ii) B 


(iii) C 


4 4 ' 


0 10 

Vi i 0/ 


0 1\ 


(iv) D = 


JL 

4 


1 0 / 


Recall that a stochastic matrix is regular if a power of the matrix has only positive entries. 
A is not regular since there is a 1 on the main diagonal (in the second row). 


B 2 = 


P 3 = 


= the identity matrix I 


= B 


Thus every even power of B is the identity matrix I and every odd power of B is the matrix B. 
Accordingly every power of B has zero entries, and so B is not regular. 
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(iii) C is not regular since it has a 1 on the main diagonal. 


/« 

1 

°\ 

/* 

i 

i\ 

i 

V6 

A 

and D 3 = ^ 

tt 

1 3 
64 

\2 

i 

iJ 

u 

TS 

- 9 _/ 

16/ 


Since all the entries of D 3 are positive, D is regular. 


MARKOV CHAINS 

7.18. A student’s study habits are as follows. If he studies one night, he is 70% sure not 
to study the next night. On the other hand, if he does not study one night, he is 60% 
sure not to study the next night as well. In the long run, how often does he study? 


The states of the system are S (studying) and T (not studying). The transition matrix is 


P 


S 

T 




To discover what happens in the long run, we must find the unique fixed probability vector t of P. 
By Problem 7.13, u = (.4, .7) is a fixed point of P and so t = is the required proba¬ 

bility vector. Thus in the long run the student studies fr of the time. 


7.19. A psychologist makes the following assumptions concerning the behavior of mice 
subjected to a particular feeding schedule. For any particular trial 80% of the mice 
that went right on the previous experiment will go right on this trial, and 60% of 
those mice that went left on the previous experiment will go right on this trial. If 
50% went right on the first trial, what would he predict for (i) the second trial, 
(ii) the third trial, (iii) the thousandth trial? 

The states of the system are R (right) and L (left). The transition matrix is 

R L 


The probability distribution for the first trial is p = (.5, .5). To compute the probability 
distribution for the next step, i.e. the second trial, multiply p by the transition matrix P: 


(•5, .5) ("g ;*) = (.7,-3) 

Thus on the second trial he predicts that 70% of the mice will go right and 30% will go left. To 
compute the probability distribution for the third trial, multiply that of the second trial by P: 

(•7, .3) ("g ;*) = (.74, .26) 

Thus on the third trial he predicts that 74% of the mice will go right and 26% will go left. 


We assume that the probability distribution for the thousandth trial is essentially the 
stationary probability distribution of the Markov chain, i.e. the unique fixed probability vector t 
of the transition matrix P. By Problem 7.13, u — (.6, .2) is a fixed point of P and so 
t = (|, J) — (.75, .25). Thus he predicts that, on the thousandth trial, 75% of the mice will go to 
the right and 25% will go to the left. 
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7.20. Given the transition matrix P = 


1 0 

* * 


with initial probability distribution 


2> (0) = (£, f). Define" and find: (i) p (3) , (ii) p (3) , (iii) p C3> . 


(i) p c 2 3) i s the probability of moving from state a 2 state a t in 3 steps. It can be obtained from 

the 3-step transition matrix P 3 ; hence first compute P 3 : 


pa -- 


1 ° ) 

1 V’ 


pa = 


1 0 

A i 


Then jd ( 3) is the entry in the second row first column of P 3 : p (3) = Z 
21 21 8 


(ii) p (3) is the probability distribution of the system after three steps. It can be obtained by 

successively computing p (1) , p< 2 > and then p< 3) : 

pci) = pC0)p = (l jj) ^ ^ = (|, 4 ) 

p ca> = p«) P = (f,£)Q J) = (*.*) 

p(3) = p(S) P = = (H.A) 

However, since the 3-step transition matrix P 3 has already been computed in (i), p (3) can also 
be obtained as follows: 

pC3> = p(0) F 3 = = (i|, *) 

(iii) is the probability that the process is in the state a 2 after 3 steps; it is the second com¬ 
ponent of the 3-step probability distribution p (3) : p^ 3) — 


7.21. 


Given the transition matrix P — \h i 

\o 1 


0 and the initial probability distribution 

0/ 


p <0) = Find: (i) and pjf, (ii) p (4) and p ( 3 4) , (iii) the vector that p (0) P" 

approaches, (iv) the matrix that P n approaches. 


(i) First compute the 2-step transition matrix P 2 : 



Then p (2) = X and p (2) = 0, since these numbers refer to the entries in P 2 . 

r 32 2 13 


(ii) To compute p C4) , use the 2-step transition matrix P 2 and the initial probability distribution 

p(0) : 

p<2) = p (0)p2 = (i,|,0) and p< 4) = p (2) P 2 = (i> &. £) 

Since p (4) is the third component of p C4) , p (4) = 

(iii) By Theorem 7.3, p^P 71 approaches the unique fixed probability vector t of P. To obtain t, 
first find any fixed vector u = (#, y, z): 
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(1 

i 

i 

0 = (*, y, z) 

or 

-< 

Uv - * 
i* + + * = y 

\0 

1 

0/ 



|i* = 2 


Find any nonzero solution of the above system of equations. Set z — 1; then by the third 
equation x = 2, and by the first equation y = 4. Thus u = (2,4,1) is a fixed point of P and so 
t — (|, i). In other words, p (0) P n approaches (f, i). 


(iv) P n approaches the matrix T whose rows are each the fixed probability vector of P; hence 


P n approaches 




7.22. A salesman’s territory consists of three cities, A, B and C. He never sells in the 
same city on successive days. If he sells in city A, then the next day he sells in 
city B. However, if he sells in either B or C, then the next day he is twice as likely 
to sell in city A as in the other city. In the long run, how often does he sell in each 
of the cities? 


The transition matrix of the problem is as follows: 

ABC 
A f 0 1 0\ 

P = B | 0 * 

C \* i 0/ 

We seek the unique fixed probability vector t of the matrix P. First find any fixed vector 

« = (*, V, *)■' 

I 0 1 °\ 

(*» y> z ) f o i I = v, z) or 

\t i V 

Set, say, z = 1. Then by the third equation y — 3, and by the first equation x = Thus 
u = (|, 3, 1). Also 3 u — (8, 9, 3) is a fixed vector of P. Multiply 3 u by 1/(8 + 9 + 3) = ~ to 
obtain the required fixed probability vector t — (f, ~ (.40, .45, .15). Thus in the long 

run he sells 40% of the time in city A, 45% of the time in B and 15% of the time in C. 


f v + v z = * 

X ~h ±Z = y 

iv = z 


7.23. There are 2 white marbles in urn A and 3 red marbles in urn B. At each step of 
the process a marble is selected from each urn and the two marbles selected are 
interchanged. Let the state a { of the system be the number i of red marbles in urn A. 
(i) Find the transition matrix P. (ii) What is the probability that there are 2 red 
marbles in urn A after 3 steps? (iii) In the long run, what is the probability that 
there are 2 red marbles in urn A ? 

(i) There are three states, a 0 , a l and a^ described by the following diagrams: 


2 W I 

3 R 

1 W 

1 w 

2 R 

2 W 

1 


1 R 

2 R 


1 R 

A 

B 

A 

B 

A 

B 


® 0 


If the system is in state a 0 , then a white marble must be selected from urn A and a red 
marble from urn B, so the system must move to state a x . Accordingly, the first row of the 
transition matrix is (0,1, 0). 
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Suppose the system is in state It can move to state a Q if and only if a red marble is 
selected from urn A and a white marble from urn B\ the probability of that happening is 
~ h Thus p 10 = The system can move from state to a 2 if and only if a white 
marble is selected from urn A and a red marble from urn B\ the probability of that happening 
is l Thus p 12 = Accordingly, the probability that the system remains in state 

a x is pH = 1 — ^ — ^ = Thus the second row of the transition matrix is (Note 

that p n can also be obtained from the fact that the system remains in the state if either 
a white marble is drawn from each urn, probability • -J- = or a red marble is drawn 
from each urn, probability ^ thus p n = % + = ^.) 

Now suppose the system is in state a 2 . A red marble must be drawn from urn A. If a 
red marble is selected from urn B , probability then the system remains in state a 2 \ and if 
a white marble is selected from urn B } probability then the system moves to state a v Note 
that the system can never move from state a 2 to the state a 0 . Thus the third row of the transit 
tion matrix is (0, -J-). That is, 

& 0 ®1 

«o / 0 1 0 \ 

p = «1 i i i 

®2 f i / 


(ii) The system began in state a 0 , i.e. p <0) = (1,0,0). Thus: 


pOi> = p co)p = (o, 1, 0), 


= p (1) P = (bhi)’ P (3) = p mp = (^>14) 


Accordingly, the probability that there are 2 red marbles in urn A after 3 steps is 


(hi) 


We seek the unique fixed probability vector t of the transition matrix P. First find any fixed 
vector w = (x,y,z): 


(z, y, z) 


1° 

1 

°\ 



r,y = x 

u 

1 

2 

*] 

= (*, y, z) 

or < 

X + \y + f z = y 

\0 

f 

ii 



iv + i z = z 


Set, sayj, x = 1. Then by the first equation y = 6, and by the third equation z = 3. Hence 
u = (1,6, 3). Multiply u by 1/(1 + 6 4- 3) = ^ to obtain the required unique fixed proba¬ 
bility vector t = (.1, .6, .3). Thus, in the long run, 30% of the time there will be 2 red marbles 
in urn A . 

Note that the long run probability distribution is the same as if the five marbles were 
placed in an urn and 2 were selected at random to put into urn A. 


7.24. A player has $2. He bets $1 at a time and wins $1 with probability He stops play¬ 
ing if he loses the $2 or wins $4. (i) What is the probability that he has lost his money 
at the end of, at most, 5 plays? (ii) What is the probability that the game lasts 
more than 7 plays? 

This is a random walk with absorbing barriers at 0 and 6 (see Examples 7.22 and 7.23). The 
transition matrix is 



a-o 

a l 

a 2 

«3 

a 4 

«5 

a 6 

a 0 

/1 

0 

0 

0 

0 

0 

° \ 

ttl 

' i 

0 

i 

0 

0 

0 

° \ 

a 2 

0 

1 

0 

i 

0 

0 

0 

a s 

0 

0 

i 

0 

1 

2 

0 

0 

a 4 

0 

0 

0 

* 

0 

1 

2 

0 

a 5 

1 0 

0 

0 

0 

1 

2 

0 

* 

a 6 

\ 0 

0 

0 

0 

0 

0 

1 1 


with initial probability distribution p (0) = (0,0,1,0,0,0,0) since he began with $2. 
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(i) We seek p< 5) , the probability that the system is in state a 0 after five steps. Compute the 
5th step probability distribution p (5) : 

pd) = = (0, 4, 0, 0, 0, 0) p< 4 > - p«>P - (f, 0, JL, 0, i, 0, Jj) 

p( 2 ) = p(i)p = (J, 0 , i, 0 , i, 0 , 0 ) p< 5 > - = (§ 4 , 04 , 0 ,^) 

p(3) = p(2)p = (J, 0, f, 0, 0) 

Thus p^ 5) , the probability that he has no money after 5 plays, is 


(ii) Compute p (7) : p (G) = p (5) P 


9 0 ± 0 H 0 h 

4 ’ 3 2 * 64 * V > o'* P 


= P 




'64 » 64* 128 * 128 * 8 ' 


The probability that the game lasts more than 7 plays, i.e. that the system is not in state 
a o or a e after 7 steps, is £ + ^ ^ 


7.25. Consider repeated tosses of a fair die. Let X n be the maximum of the numbers 
occurring in the first n trials. 

(i) Find the transition matrix P of the Markov chain. Is the matrix regular? 

(ii) Find p (1) , the probability distribution after the first toss. 

(iii) Find p (2) and p (S \ 


(i) The state space of the Markov chain is (1, 2, 3, 4, 5, 6). The transition matrix is 


P 


1 2 3 4 5 6 


1 

2 

3 

4 

5 

6 



0 

0 


\ 


* 

0 

0 

0 

0 


iii 

ill 
6 6 6 

I i i 

0 ! i 

o o f 

0 0 0 



We obtain, for example, the third row of the matrix as follows. Suppose the system is in 
state 3, i.e. the maximum of the numbers occurring on the first n trials is 3. Then the 
system remains in state 3 if a 1, 2, or 3 occurs on the (n-fl)-st trial; hence p 33 = On the other 
hand, the system moves to state 4, 5 or 6, respectively, if a 4, 5 or 6 occurs on the (n + l)-st trial; 
hence p 34 — p 35 = p 36 = ^. The system can never move to state 1 or 2 since a 3 has occurred 
on one of the trials; hence p 31 = p 32 = 0- Thus the third row of the transition matrix is 
(0, 0, | , £). The other rows are obtained similarly. 

The matrix is not regular since state 6 is absorbing, i.e. there is a 1 on the main diagonal 
in row 6. 


(ii) On the first toss of the die, the state of the system is the number occurring; hence 

p (1) = <*. b b b b b- 


(iii) p (2) = p (i:> P 


(1 i 1 I i 11) „(8) = v mp = /J_ 7 iL^A) 

'36* 36* 36* 36 » 36 » 36'* ** H V 216 » 216 9 216 9 216 * 216 * 216 ; 


7.26. Two boys bi and b 2 and two girls gi and g 2 are throwing a ball from one to the other. 
Each boy throws the ball to the other boy with probability h and to each girl with 
probability 1 On the other hand, each girl throws the ball to each boy with 
probability | and never to the other girl. In the long run, how often does each 
receive the ball ? 

This is a Markov chain with state space (6,, b 2 , g lt g 2 } and transition matrix 

b l b 2 9 1 9 2 

b i /° i i i\ 

4 » i 

U i 0 0 

\i i 0 oj 


p 


ffl 
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We seek a fixed vector u ~ (x,y 9 z,w) of P: (x,y,z,w)P = (x,y>z,w). Set the corresponding 
components qf uP equal to u to obtain the system 

\y + p + 

\x + \z + 

i* + = * 

ix + \y = W 

We seek any nonzero solution. Set, say, z = 1; then w = 1, x — 2 and p = 2. Thus w = (2, 2,1,1) 

and so the unique fixed probability of P is t = (J, -J, J). Thus, in the long run, each boy 

receives the ball ^ of the time and each girl l of the time. 



7.27. Prove Theorem 7.6: Let P = (pa) be the transition matrix of a Markov chain. 
If V = (Pi) is the probability distribution of the system at some arbitrary time k, 
then pP is the probability distribution of the system one step later, i.e. at time k + 1; 
hence pP n is the probability distribution of the system n steps later, i.e. at time 
k + n. In particular, p cl) = p (0) P, p (2) = p cl) P, ... and also p (n) — p (0) P”. 

Suppose the state space is {<Xi, a 2 ,.. ., a m }. The probability that the system is in state a^ at 
time k and then in state a { at time h+ 1 is the product VjVjv Thus the probability that the system 
is in state at time k 4- 1 is the sum 

Til 

PlPli + P2P2i + ' ' • + PmPmi = 2 PjPjx 

3 = 1 

Thus the probability distribution at time k + 1 is 

( mm m \ 

2 PjPjU 2 PjPj2. •••» 2 PjPjm ) 

3=1 3=1 3=1 / 

However, this vector is precisely the product of the vector p — (p^ by the matrix P = (py): p* = pP. 


7.28. Prove Theorem 7.5: Let P be the transition matrix of a Markov chain. Then the 
%-step transition matrix is equal to the nth power of P: P Cn) = P n . 

Suppose the system is in state at, say, time /c. We seek the probability p<”> that the system 
is in state aj at time k + n. Now the probability distribution of the system at time k, since the 
system is in state a it is the vector e i — (0, ...,0,1,0, ,..,0) which has a 1 at the 'ith position 
and zeros everywhere else. By the preceding problem, the probability distribution at time k + n 
is the product e t P n . But e^P 71 is the ^th row of the matrix P n . Thus Pf. 7l) is the jth. component 
of the ith row of P n , and so P (n) = P n . 


MISCELLANEOUS PROBLEMS 

7.29. The transition probabilities of a Markov chain can be represented by a diagram, 
called a transition diagram , where a positive probability pa is denoted by an arrow 
from the state Oi to the state aj. Find the transition matrix of each of the following 
transition diagrams: 
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(i) Note first that the state space is {a 1 ,o 2 , a 3 } and so the transition matrix is of the form 


a l a 2 a s 



The 7 th row of the matrix is obtained by finding those arrows which emanate from in the 
diagram; the number attached to the arrow from to a 3 - is the ith component of the ith row. 
Thus the transition matrix is 




a 1 

a 2 

a 3 


a l 


0 

1 

p 

= a 2 

1 

0 

JL 

2 


a 3 

u 

0 

1 

2 


(ii) The state space is {a ly a 2) a 3 , u 4 }. The transition matrix is 

(Xj d 2 CE3 

«i /o i o i\ 

a 2 0 $ 0 i 

«i I 0 0 i 

a 4 \ 0 0 1 0 / 


7.30. Suppose the transition matrix of a Markov chain is as follows: 


P 


Is the Markov chain regular? 


ai a 2 


ai 

a 2 

a 3 

a 4 


/ 


i 


u 3 (X 4 
0 0\ 
0 0 


i 


Note that once the system enters the state or the state a 2 , then it can never move to 
state a 3 or state a 4 , i.e. the system remains in the state subspace { a lt a 2 }• Thus, in particular, 
pin) — 0 for every n and so every power P n will contain a zero entry. Hence P is not regular. 


7.31. Suppose m points on a circle are numbered respectively 1,2, .. .,ra in a counterclock¬ 
wise direction. A particle performs a “random walk” on the circle; it moves one 
step counterclockwise with probability p or one step clockwise with probability 
q = 1 — p. Find the transition matrix of this Markov chain. 


The state space is {1,2, . . ., m}. The diagram to the right below can be used to obtain the 
transition matrix which appears to the left below. 
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Supplementary Problems 


MATRIX MULTIPLICATION 

(1 -2 3\ 

7.32. Given A = I 4 1 — 1 ). 

\5 2 3/ 


Find uA if (i) u = (1, -3,2), (ii) u = (3,0, -2), (iii) u = (4, -1, -1). 


7.33. Given A 




Find AB and BA . 


7.34. Given A 



Find A 2 and A 3 . 


7.35. Given A 


fl 2\ 

„ ). Find A n . 

\° i/ 


PROBABILITY VECTORS AND STOCHASTIC MATRICES 

7.36. Which vectors are probability vectors? 


(i) (bh~bi) (ii) ,bi) (iii) ^hb^b- 


7.37. Find a scalar multiple of each vector which is a probability vector: 

(i) (3, 0,2, 5, 3) (ii) (2,£,0,£,f,0,l) (iii) (£, 2, 0,£, §). 

7.38. Which matrices are stochastic? 



REGULAR STOCHASTIC MATRICES AND FIXED PROBABILITY VECTORS 

7.39. Find the unique fixed probability vector of each matrix: 



/ 0 I i\ 

7.40. (i) Find the unique fixed probability vector t of P = -J- ^ 0 

\0 1 oy 

(ii) What matrix does P n approach? (iii) What vector does (J» J»£)F n approach? 


7.41. 


7.42. 


Find the unique fixed probability vector t of each matrix: 

/° i i\ /o 1 o\ 

d) A = U | 0 (ii) B = U 0 * 

\° 1 °/ U i i / 

(i) Find the unique fixed probability vector t of P = 

(ii) What matrix does P n approach? 

(iii) What vector does (£, 0, £)P n approach? 

(iv) What vector does (-J-, 0,0, ^ )P n approach? 



i 0\ 

0 i 

° l ' 

o $j 



CHAP. 7] 


MARKOV CHAINS 


149 


7.43. (i) Given that t = (^, 0, £,£) is a fixed point of a stochastic matrix P, is P regular? 

(ii) Given that t — (J, i, £) is a fixed point of a stochastic matrix P, is P regular? 

7.44. Which of the stochastic matrices are regular? 

fi i i\ fi i °\ /° 0 *\ 

(i) 0 1 0 (ii) U i 0 (iii) U 0 i 

\i 0 i! \H il \° 1 o/ 


7.45. 


Show that (c/ + ce + c?e, a/+6/+ae, + is a fixed point of the matrix 



MARKOV CHAINS 

7.46. A man’s smoking habits are as follows. If he smokes filter cigarettes one week, he switches to 
nonfilter cigarettes the next week with probability .2. On the other hand, if he smokes nonfilter 
cigarettes one week, there is a probability of .7 that he will smoke nonfilter cigarettes the next week 
as well. In the long run, how often does he smoke filter cigarettes? 

7.47. A gambler’s luck follows a pattern. If he wins a game, the probability of winning the next game 
is .6. However, if he loses a game, the probability of losing the next game is .7. There is an 
even chance that the gambler wins the first game. 

(i) What is the probability that he wins the second game? 

(ii) What is the probability that he wins the third game? 

(iii) In the long run, how often will he win? 


7.48. For a Markov chain, the transition matrix is P = L - ) with initial probability distribution 

\4 4 / 

p( 0 ) = (J,|). Find: (i) (ii) (iii) P (2) ; (iv) Pj 2) ; ( v ) the vector p (0) P n approaches; 

(vi) the matrix P n approaches. 

(\ 0 *\ 

7.49. For a Markov chain, the transition matrix is P = 1 0 0 and the initial probability dis- 

u * il 

tribution is p (0) = (^>^>0)- Find (i) p (ii) p<(iii) (iv) 

7.50. Each year a man trades his car for a new car. If he has a Buick, he trades it for a Plymouth. 

If he has a Plymouth, he trades it for a Ford. However, if he has a Ford, he is just as likely to 

trade it for a new Ford as to trade it for a Buick or a Plymouth. In 1955 he bought his first car 
which was a Ford. 

(i) Find the probability that he has a (a) 1957 Ford, (6) 1957 Buick, (c) 1958 Plymouth, 
(d) 1958 Ford. 

(ii) In the long run, how often will he have a Ford? 

7.51. There are 2 white marbles in urn A and 4 red marbles in urn B. At each step of the process a 

marble is selected from each urn, and the two marbles selected are interchanged. Let X n be the 
number of red marbles in urn A after n interchanges, (i) Find the transition matrix P. (ii) What 
is the probability that there are 2 red marbles in urn A after 3 steps? (iii) In the long run, what 

is the probability that there are 2 red marbles in urn A? 

7.52. Solve the preceding problem in the case that there are 3 white marbles in urn A and 3 red marbles 
in urn B. 
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7.53. A fair coin is tossed until 3 heads occur in a row. Let X n be the length of the sequence of heads 
ending at the nth trial. (See Example 7.24.) What is the probability that there are at least 
8 tosses of the coin? 

7.54. A player has 3 dollars. At each play of a game, he loses one 
dollar with probability J but wins two dollars with probability J. 

He stops playing if he has lost his 3 dollars or he has won at 
least 3 dollars. 

(i) Find the transition matrix of the Markov chain. 

(ii) What is the probability that there are at least 4 plays to 
the game? 


7.55. The diagram on the right shows four compartments with doors 
leading from one to another. A mouse in any compartment is 
equally likely to pass through each of the doors of the compart¬ 
ment. Find the transition matrix of the Markov chain. 



MISCELLANEOUS PROBLEMS 

7.56. Find the transition matrix corresponding to each transition diagram: 




7.57. Draw a transition diagram for each transition matrix: 






di 

a 2 

a 3 

«1 

(° 

1 

2 

i' 

2 

a 2 

l 

4 

1 

4 


a 3 

v° 

1 

2 

i t 


7.58. Consider the vector e t — (0, .. ., 0,1, 0, ..., 0) which has a 1 at the tth position and zeros elsewhere. 
Show that e { A is the ith. row of the matrix A (whenever the product is defined). 


Answers to Supplementary Problems 

7.32. (i) (-1, -1,12), (ii) (-7, -10,3), (iii) (-5, -11,10) 



7.36. Only (iii). 
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7.37. 

7.38. 

7.39. 

7.40. 

7.41. 

7.42. 

7.43. 

7.44. 

7.46. 

7.47. 

7.48. 

7.49. 

7.50. 

7.51. 

7.52. 

7.53. 


7.54. 


7.55. 

7.57. 
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P(n , r), 17 
Partitions, 5, 22, 56 
Pascal's triangle, 20 
Permutations, 16 
with repetitions, 17 
Poisson distribution, 108 
Positive integers, 2 
Power set, 5 
Preimage, 74 
Probability, 38, 40 
conditional, 54 
function, 40, 75 
product, 50 
vector, 127 

Process (stochastic), 55 
Product 

probability space, 50 
set, 4 

R (real numbers), 2 
Random events, 42 
Random variables, 74 
continuous, 84 
discrete, 83 
function of, 82 
independent, 81, 85 
Range, 74 
Real numbers, 2 
Regular stochastic matrix, 128 
Relative 
complement, 2 
frequency, 38 


Repeated trials, 58 
Row of a matrix, 126 

Sample 
mean, 87 
point, 38 
space, 38 
Sampling, 18 
Scalar multiplication, 126 
Set, 1 

Square matrix, 126 
Standard deviation, 78, 83, 85 
Standard 

normal distribution, 107 
units, 107 

Standardized random variable, 79 
State space, 130 
Stationary distribution, 133 
Stochastic 
matrix, 128 
process, 55 
Subset, 1 

Success (binomial distribution), 105 

Tchebycheff’s inequality, 86 
Techniques of counting, 16 
Transition 
diagram, 146 
matrix, 130 

Tree diagram, 9, 23, 55 
Trials, independent, 58, 68 

Uncountable sets, 4 
Uniform space, 42, 43 
Union of sets, 2 
Universal set, 1 

Variance, 78, 83, 84 
Vector, 126 
probability, 127 
Venn diagrams, 3 

Weighted average, 75 

Z (integers), 2 
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